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Abstract 

The capacity region of the 2-user muhi-input multi-output (MIMO) Gaussian interference channel 
(IC) is characterized to within a constant gap that is independent of the signal-to-noise ratio (SNR) and 
^ ^ all channel parameters for the general case of the MIMO IC with an arbitrary number of antennas at 

^ each node. For a class of MIMO ICs characterized by a certain relationship between the numbers of 

■ antennas at the different nodes this gap is strictly smaller than the gap in a previous result obtained by 

' Telatar and Tse. For instance, the gap for the SIMO IC with single antenna transmitters and A^-antenna 

' receivers obtained here is 1 bit, instead of N bits. Moreover, in contrast to that previous work, a simple 

(N : 

, and an explicit achievable coding scheme are given here that have the constant-gap-to-capacity property 

i and in which the sub-rates of the common and private messages of each user are explicitly specified 

for each achievable rate pair The constant-gap-to-capacity results are thus proved in this work by first 
establishing explicit upper and lower bounds to the capacity region. A reciprocity result is also proved 
. . , which is that the capacity of the reciprocal MIMO IC is within a constant gap of the capacity region of 

. the forward MIMO IC 

Index Terms 
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I. Introduction 



The 2-user interference channel is a model for a single-hop, multi-flow wireless network in which mul- 
tiple transmit-receive pairs communicate over a common noisy channel in that it captures the fundamental 
interactions between the multiple transmitted signals in such networks, namely, broadcast, superposition 
and interference. This model was first mentioned in ||T1, and was studied in a series of works in ||2l, ||3], 
101, 121, in, n, im that considered certain special classes of the IC where the capacity regions of the 
so-called very strong IC, the strong IC and certain classes of degraded and deterministic ICs, respectively, 
were established. Different sets of inner and outer bounds considering the embedded multiple-access and 
broadcast and Z channels were derived in ||9], lH], lITOl . lITTI . |[T2l . |[T3l . However, the Han-Kobayashi 
(HK) (Hi coding scheme that improves that of the rate-splitting strategy of lITTI remains the best known 
achievable scheme for this channel. In spite of over 3 decades of research, the capacity region in the 
general case remained unsolved. 



Recent results include the simplified description of the HK rate region due to Chong et al. in ||T4l (see 
also |[T5l . nil) and the capacity regions of new and/or more general classes of channels than for which 
capacity was previously known, e.g., the sum capacity of the so-called noisy interference channels was 
found in lITTl . ifTSl . |fT9l and the capacity region of the very strong and aligned strong MIMO IC were 
found in ll20l . The common feature of this line of work is that it focuses on a small subset of channel 
parameters but seeks to solve the challenging problem of obtaining the exact capacity of the channel. 

A different line of research was initiated by Etkin et al. 11211 . where the authors find an approximation 
of the capacity region of the two-user scalar Gaussian IC where the criterion of approximation is to 
specify the capacity region to within a constant gap independently of SNR and the direct and cross 
channel coefficients. Moreover, they obtain that result through a simple HK scheme, i.e., by identifying a 
single, channel parameter dependent, joint distribution of input and auxiliary random variables among the 
infinitely many possible specifications including time-sharing that together contribute to the general HK 
rate region. The key feature of this simple HK scheme is that each user employs independent Gaussian 
superposition coding of private and public messages with the private message power set so that it reaches 
the unintended receiver at the noise level. A 1 bit gap to capacity was proved in 11211 using the simplified 
description of the HK rate region of |[T4l . Thus, the result of ||2TI characterizes the capacity region to 
within a constant gap that is independent of the SNR and all channel coefficients. Moreover, it identifies a 
simple HK scheme that has this property thereby also providing an explicit expression for the achievable 
rate region in terms of channel parameters. 
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Since most modern wireless communication systems feature multiple antennas at some or all terminals 
it is of interest to study the 2-user Gaussian MIMO IC. However, multiple antennas at different nodes 
make it harder to obtain results similar to those available for the SISO IC. For instance, the deterministic 
model developed in ll22l for the 2-user SISO IC which was shown to reproduce the constant gap to 
capacity approximation result of 11211 doesn't extend to MIMO channels. Moreover, as compared to the 
result on the capacity of the strong SISO IC, the capacity of the MIMO IC is known f20l only for the 
so-called aligned strong interference regime, where the direct and cross link channel matrices satisfy a 
matrix equation. Two important constraints involved in this result are (a) a covariance constraint on the 
inputs and (b) the direct link's channel matrix is a matrix multiple of the cross link's channel, where 
the multiple satisfies some particular constraint. The covariance constraint can be relaxed at the cost of 
making the expression for the capacity region much more complicated, but the matrix relation between 
the two channel matrices may seldom, if ever, hold. In general, the problem of characterizing the exact 
capacity of a MIMO IC even for small and special classes can be challenging; this point is also illustrated 
by Il23l where the capacity region of a class of very strong MISO ICs was characterized. 

In |[24l . Telatar and Tse consider an interesting class of two-user semi-deterministic discrete memoryless 
ICs which generalizes the class of deterministic ICs of ||8] and is also applicable to the Gaussian MIMO 
IC. They obtain outer bounds to the capacity region that are within a gap specified in terms of certain 
conditional mutual informations to the general HK achievable region |[T4l . The implication of this work 
to the 2-user MIMO IC is that the union of all the achievable atomic rate sub-regions of the general HK 
scheme (one sub-region for each input distribution), is within a constant gap (of Ni bits, where Ni is 
the number of antennas at receiver i) to the outer bound developed therein (which in turn is given as 
a union over all input and "time-sharing" distributions), and hence, to the capacity region. However, no 
specific achievable scheme is identified with the constant-gap-to-capacity property among the infinitely 
many possibiUties that make up the the general HK scheme. In fact, it is unclear from that work if there 
exists a simple HK scheme in general (corresponding to a single input distribution, as it does for the 
SISO case ||2T1 ) or even an explicit HK scheme (whose rate region is the union of rate regions achievable 
by a finite number of input distributions) with the constant-gap-to-capacity property. Moreover, since the 
upper and lower bounds are not given explicitly as functions of the channel matrices in ll24l they cannot 
be used for further analysis such as for example, for finding the generalized degrees of freedom (GDoF) 
analysis, as mentioned in |[25l . 
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In this paper, we consider the 2-user Gaussian MIMO IC with an arbitrary number of antennas at 
each node. Without restricting the channel matrices in any way, we obtain constant-gap-to-capacity 
characterizations through a simple and an explicit HK scheme, neither of which involves time-sharing. 
The approach we adopt is as follows: starting from a set of genie-aided strategies that are similar to, 
but are not the same as, those of Il24l (see Remark [B, we establish a set of explicit channel-matrices- 
dependent upper bounds to the capacity region of the 2-user MIMO IC under input power constraints, i.e., 
the resulting explicit outer bound on the capacity region does not involve a union over input distributions 
as does the outer bound in |[24]|. Consequently, inspired by a novel interpretation of this outer bound, we 
propose a simple HK coding scheme which involves independent Gaussian linear superposition coding 
with certain explicit channel dependent covariance matrix assignments for the private and public messages 
of each user and show that this input distribution produces a rate region that is within a constant gap to 
the capacity region. Moreover, the explicit bounds obtained here were used by the authors to obtain the 
GDoF region of the MIMO IC in the companion paper ll26l . 

The above specification of coding scheme does not conform to the specification of Il24l on the choice 
of the conditional distribution of the auxiliary random variables given the inputs. It is thus distinct from 
any achievable scheme that might result as a consequence of the prescription of (see Remark [8]l. 
Moreover, since in the HK coding scheme the public message of a user gets decoded at the receiver of 
the other user, it is important to choose the sub-rates of the private and public messages of each user 
carefully because an arbitrary rate for the public message might not be supported if the corresponding 
cross-link is weak. We thus also specify explicitly the set of these sub-rates for the private and public 
messages for which such a scenario never arises. In fact, a two-dimensional projection of this latter set 
actually yields the achievable rate region of the simple HK coding scheme. 

The gap to capacity of the aforementioned simple HK coding scheme is then improved by proposing 
an explicit HK scheme where the transmitters are allowed to use one of three simple superposition coding 
schemes depending on the operating rate pair. Interestingly, for a large class of MIMO ICs, this latter gap 
is smaller than the gap of Ni bits of |f24l . This class includes, for example, SIMO ICs (with single-antenna 
transmitters and multiple antenna receivers) for which the gap is 1 bit, instead of Ni bits. 

Using the explicit expressions for both the achievable rate region and the set of upper bounds to the 
capacity region of the MIMO IC, we then derive an interesting reciprocity result which is that the capacity 
of a 2-user MIMO IC is within a constant gap to that of the channel obtained by interchanging the roles 
of the transmitters and the receivers. 

The rest of the paper is organized as follows. Following a description of the notations used in this 
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paper in Section JIl we specify the system model. In Section |llll we derive a set of upper bounds 
to the capacity region and two different rate regions achievable by one simple and one explicit HK 
coding scheme. Comparing the set of upper and lower bounds, the capacity region of the MIMO IC 
is characterized within a constant number of bits. As a byproduct of this analysis, we also prove the 
reciprocity of the capacity region of the MIMO IC in the approximate capacity sense in Section IIII-DI 
Finally, Section |IV] concludes the paper. In order that the paper is easy to read, many of the proofs are 
given in the Appendices. 

Notations: Let C and M"*" represent the field of complex numbers and the set of non-negative 
real numbers, respectively. An n x m matrix with entries in C will be denoted as A £ £_nxm^ 
conjugate transpose of the matrix A is denoted as A^^ and its determinant as |^|. Let ||2;p represents 
the square of the absolute value of the complex number, i.e., if z = {x + iy) then = + y^. 
The trace of the matrix A G C"^" is denoted as Tx{A), i.e., Tx{A) = Y^^=i ^a- represents the n x n 
identity matrix, Omxn represents an all zero mx n matrix and U"^" represents the set oi nx n unitary 
matrices. The k^^ column of the matrix A will be denoted by A^^'^ whereas A^^^'-^'^^ represents a matrix 
whose columns are same as the k^^ to columns of matrix A. \A\ denotes the cardinahty of the 
set A. The fact that {A — B) is & positive semi-definite (p.s.d.) (or positive definite (p.d.)) matrix is 
denoted hy A^ B {ox A>- B). A (g) i? denotes the tensor or Kronecker product of the two matrices. If 
xt G C™^^, V 1 < t < n, then x" = [x\, ■ ■ ■ , Xn]^ {A, B, C, D} will represent an ordered set of matrices. 
Moreover, I{X; Y),I{X; Y\Z), h{X) and h{X\Y) represents the mutual information, conditional mutual 
information, differential entropy and conditional differential entropy of the random variable arguments, 
respectively. The quantities x A y, x V y and x~^ denote the minimum and maximum between x and y 
and the max{x, 0}, respectively. All the logarithms in this paper are with base 2. The distribution of 
a complex circularly symmetric Gaussian random vector with zero mean and covariance matrix Q is 
denoted as C7V(0, (5). ■ 

II. Channel Model and Mathematical preliminaries 

The 2-user MIMO IC is considered where transmitter i (Txi) has Mi antennas and receiver i {Rxi) 
has Ni antennas, respectively, for i = 1, 2. Such a MIMO IC will be referred to henceforth as the 
(Ml, A^i,M2,iV2) MIMO IC. Let the matrix Hij G C^^x*^- denote the channel between Txi and Rxj. 
We shall consider a time-invariant or fixed channel where the channel matrices remain fixed for the entire 
duration of communication. The {Mi,Ni,M2,N2) MIMO IC is depicted in Fig. [T] We also incorporate 
a real-valued attenuation factor, denoted as rjij, for the signal transmitted from Txi to receiver Rxj. At 
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time t, Txi chooses a vector Xa G C and sends y/PiXit over the channel, where we assume the 
following average input power constraint at Txi, 

1 " 

-VTr(Q,t) < 1, (1) 

for i G {1, 2}, where Qu = E(XjfX]j). Note that in the above power constraint Qit's can depend on the 
channel matrices. 

p Rxi(Ni) 

5; RX2(N2) 

Fig. 1: The (Mi, A^i, Ma, iVs) MIMO IC. 

The received signals at time t can be written as 

Yit = ^HiiXit + ^H2iX2t + Ziu (2) 

= i/P22-f^22-'^2t + \fPv2H12X1t + ^24, (3) 

where Zn G C^'^^ are i.i.d CAf{0, In^) across i and t, pu = rju^/Pi represents the signal-to-noise ratio 
(SNR) at receiver i and pij = r]ijy/P^ represents the interference-to-noise ratio (INR) at receiver j for 
i j ^ {1)2}. In what follows, the MIMO IC with channel matrices, SNRs and INRs as described 
above will be denoted by IC {V.,p), where V. = {Hii,Hi2, -f^2i, -^^22} and p = [pn, P12, ^21,^22]- The 
capacity region of IC {T-L, p) will be denoted by C {T-L, p) and is defined as follows. 

Let us assume that user i transmits information at a rate of Ri to Rxi using the codebook Cj^„ of n- 
length codewords with |Cj^„| = 2"^\ Given a message mj G {1, • • • , 2"^'}, the corresponding codeword 
X^{mi) G Ci^n must satisfy the power constraint given in equation ([T}- From the received signal YJ", 
the receiver obtains an estimate rhi of the transmitted message nii using a decoding function fi^n, i-C, 
fi,n{Yr) — ''^i- average probability of error be denoted by ei^„ = E (Pr (rfij 7^ mi j). 

A rate pair {Ri, R2) is achievable if there exists a family of codebooks 1 < i < 2}„ and decoding 
functions {/j,n(-)) 1 ^ ^ ^ 2}„ such that maxj{ej^„} goes to zero as the block length n goes to infinity. 
The capacity region C{'H, p) of IC {T-L, p) is defined as the closure of the set of achievable rate pairs. 



Txi(Mi) 



Hi] 



TX2(M2 
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Definition 1: An achievable rate region is said to be within rii bits of the capacity region if for any 
given rate pair {Ri,R2) G C{'H, p) the rate pair {{Ri — ni)^, {R2 — ^2)'*') hes in the achievable region. 



In this section, we shall characterize the capacity region of the 2-user MIMO IC to within a constant 
number of bits where the constant is independent of SNRs, INRs and the channel matrices. Such a 
characterization involves establishing a rate region and showing that no rate pair in the capacity region can 
be further from all the points in the achievable region by more than this constant. Such a characterization 
of the capacity region will sometimes be referred as the approximate capacity of the channel and the 
constant as the gap of approximation. A coding scheme which can achieve a rate region that is within 
a constant number of bits will be called an approximate capacity optimal (or constant-gap-to-capacity 
optimal) coding scheme. 

In what follows, we shall first obtain a set of explicit upper bounds to the capacity region in terms 
of the channel matrices. We then give an operational interpretation of these bounds which in turn helps 
us identify a particular input distribution and linear superposition scheme (by specifying the covariance 
matrices for the private and public message of each user) leading to a simple HK coding scheme. 
The achievable rate region of this coding scheme and the corresponding gap to approximate capacity is 



computed in Section IIII-BI Comparing these set of upper and lower bounds we prove that the two bounds 
are within ni bits of each other - thus proving the constant gap capacity result - where 

Ui = max{(mii log(Mi) + niij log(Mi + 1)) , min{A^i, M,} log(iV4)} + niji, for 1 < i / j < 2 (4) 

with = max{Mi, M2}, Ms = {Mi + M2), rriij = min{Mj, A^^}, and rhij = niij log (^^7^) • 



In Section |III-C[ an improvement is proposed by allowing the transmitters to select one of three 
carefully chosen superposition strategies depending on the rate pair to be achieved. It will be shown that 
the achievable region of this explicit HK coding scheme is within n* bits to the capacity region, where 



Note that on a SIMO IC, n* = 1. Finally, in Section ITlI-D[ we prove the constant gap reciprocity of the 
MIMO IC, i.e., the capacity of the 2-user MIMO IC does not change by more than a constant number 
of bits if the roles of the transmitters and receivers are interchanged. 



III. Capacity to within a Constant Gap 




n* = mm{Ni, MJ log(M^) + mji, for 1 < i / j < 2. 



(5) 
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A. A new outer bound to the capacity region 

The set of outer bounds to the capacity region for IC{'H, p), derived in this paper, will be denoted by 
7^"('H, p). For economy of notation, we define the matrices 

Ki = {Inu + PiMjHij) 1 < i / J < 2. (6) 

Lemma 1 (The Outer Bound): For a given % and p the capacity region, C(^, p) of a 2-user MIMO 
Gaussian IC, with input power constraint ([T}, is contained within the set of rate pairs Vy-{T-L,p), i.e., 

where Tl'^{'H,p) represents the set of rate pairs {Ri,R2), satisfying the following constraints: 

iii < logdet(/;v, + PuHuHI) ; (V) 
i?2 < log det (/tv. + P22H22HI^ ; (8) 
Ri + R2< log det (In, + PuHuHI^ + P22H22HI^ + log det (In, + piiHiiKiH\^ ; (9) 



Ri + R2< log det [In, + P2i^2i^li + PiiHiiH\^ + log det ( In, + P22-?^22-?^2^^22 ) ; (10) 
Ri + R2< logdet(^/7Vi + P2i^2i^li + PuHuKiHI^ 

+ log det (In, + PuHuHI^ + P22H22K2HI2 ) ; (11) 



2Ri + R2< logdet(/iVi + P2iH2iHI^ + puHuHl^j + log det (^In, + puHuKiHl^j + 

log det (In, + P12II12HI2 + P22H22K2HI^ ■ (12) 



Ri + 2R2 < log det (/tv, + P12H12HI2 + P22H22Hl^j + log det (j^v, + P22H22K2H\^^ + 

log det (/,v, + P21H21HI, + puHuKiHl) . (13) 



Proof of Lemma \T} The proof is given in Appendix |A] ■ 
In the following three remarks we describe how the above lemma is distinct from related results in 
the literature in llITI. Il24l. Il27l. 

Remark 1: There are two major differences between the set of upper bounds given in ll24l and those 
provided in Lemma [D 1) The genie aided signal or side information provided at the different output 
nodes are different from those of Il24l . For instance, to derive the first three sum-rate upper bounds we 
provide Rxi with the interference plus noise received at Rxj, i.e., 5" = (/„ ® Hij) + as the genie 
signal whereas the corresponding genie signal of Il24l is some signal which has the same distribution as 
5" but is conditionally independent of 5" given X^; and 2) the set of upper bounds of 1241 are specified 
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as a function of input (and time-sharing) distributions whereas the bounds of Lemma [T] are expUcit and 
vaUd for any 2-user MIMO IC with the input power constraints in ([T]). 

Remark 2: It was mentioned in Il24l that obtaining the approximate capacity results for the MIMO 
case starting from the result of 11211 appears difficult. It seems that the difficulty lies in deriving a 
tight upper bound for the capacity region in the case when either Mi > Ni or Af, > Nj^i. For example, 
consider the first sum rate upper bound in [|2n . where the second user's codeword is given to the receiver 
of the first user. Using Fano's inequality for the MIMO channel we have 

n{Ri + R2) <I{Xl';Y{') + I{X1^-Y;') + nen, [here, e„ ^ as n ^ 00] 
=h{YnXl^) - h{Z^) + h{Y^) - h{Y^\Xl^) + ne„, 

which simplifies to 

n{Ri + R2) < h{^{In » Hu)X^ + Z^) - h{^{In ® H^2)X^ + ^2 ) + KY2I " KZi) + ne„. 

Now h{Y2) in the above equation can be easily upper bounded by Gaussian inputs and upper bounding 
the difference between the first two differential entropies is relatively easy when both Hu and H12 are 
square. A similar concept can be extended to the case when both of these matrices have a larger row 
dimension than the column dimension, using the singular value decomposition (SVD) of these matrices. 
In fact, this approach was used in |[28l to extend just the generahzed degrees of freedom result for SISO 
Gaussian IC of lEH to the class of symmetric (M, N, M, N) MIMO ICs with N > M. However, it 
can not be applied when Mi > Ni or Mi > N2 because it is not clear how to upper bound the above 
difference in such a way that it is tight enough to yield a constant gap to capacity or even tight in the 
weaker sense of generalized degrees of freedom. In the genie-based model, used in the proof in Appendix 
lAl this problem does not arise. 

Remark 3: Two sets of explicit upper bounds to the capacity region of the MIMO IC, denoted therein 
respectively as 1Zq{H, G) and 1Zqq{H, G), were derived in ||27| from the result in ll24l . It can be easily 
verified that the first four bounds in Lemma [T] are identical to those in ll27l and the fifth bound (on 
Ri + R2) can be shown to be equivalent to the 7*^ bound of 1Zq{H, G) and TZqq{H, G). However, the 
bounds on {2Ri + R2) and {Ri + 2R2) in |[27l are incorrect. Figure |2] illustrates this fact by showing 
that on the SISO IC specified in Example [U the bound on {2Ri + R2) in TZqq{H,G) contradicts the 
achievability of some of the rate pairs. We elaborate this point further. 
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Consider the bound on {2Ri + R2) in Tloo{H, G), which for the SISO channel, can be written as 



(2iii+i?2) <log(l + P||Fnr + P||^2ir)+log( 1+ ^^^,,2 



211 

2 , _P||j^22]P 
1 + P||^12| 



+iog( i + p||ff2if + , , „„;; 112 1 ■ (14) 



By the notation of llB, SNRj = PiWHuW^ and INRj = |p for i / j G {1, 2}. Using this notation 

in the above equation we get 

SNRi 
1 + INR2 



{2Ri + R2) < log(l + SNRi + INRi)+log 1 + 



The corresponding bound in the achievable region for the weak SISO IC (INRi < SNR2 and E^R2 < 
SNRi) derived in 121] is given as (e.g., see Theorem 5, equation (61) in [21J) 

SNRA 



{2Ri + ^2) < log(l + SNRi + INRi)+log 2 + 



INR2 / 



SNR2\ 
INRi 



+log ( 1 + INR2 + ^ j - 3 ^ J^. (16) 



Comparing the two bounds in equation ([TST i and ([T6] l we see 

< bEE^ + log (1 + INRi) - log (1 + INR2) + 3, 

which implies that if INRi is sufficiently smaller than INR2, i.e., log (1 + INRi) < log (1 + INR2) then 
the upper bound in ([TSl l can be strictly smaller than the lower bound ([T6l ). Suppose there exists an IC on 
which, in addition to the fact that B^'^ < Bf^^, the bound ([T6l ) is active. This would imply that there 
exists achievable rate pairs (i?i, R2) which satisfy equation (fT6l) but violate (fTSl ) since Bff^ < Bf^^. 
Now, (i?i,i?2) is an achievable rate pair and can not violate an upper bound unless it is incorrect. The 
following example proves the existence of such a channel. 

Example 1: Consider a real SISO IC with Hu = 45, Hu = 25, H21 = 3, H22 = 30 and Pi = P2 = 
P = 1. For this channel we have SNRi = 2025 > INR2 = 625 and SNR2 = 900 > INR2 = 9. Clearly, 
this is a weak interference channel (by the definition of (21]), for which an achievable rate region can 
be easily computed using equation (61) in Theorem 5 of ^2T\ and is depicted in Fig. |2] as the polygon 
ABCDEF. On the other hand, putting the values of SNR^'s and INR/s in equation (fTSl ) we get the upper 
bound on (2i?i + R2) from ||27] . which is also depicted in the figure. According to this bound any point 
(e.g., point S in the figure), above the line segment PQ is not achievable, which evidently is not true. 
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(In bpcu) 



Fig. 2: Comparison of an achievable rate region and the upper bound on {2Ri + R2) from ll27l . 

B. A new inner bound on the capacity region via a simple achievable scheme 

The achievable region of ll24l . which is specified as a union of sub-regions over all possible input 
distributions and time-sharing schemes, was shown to be within a constant gap (of A'^; bits) of the capacity 
region. It is not clear from that result as to whether a coding scheme coixesponding to a single input 
distribution, or one that corresponds to time sharing between a few carefully chosen input distributions, 
can achieve the capacity of the Gaussian MIMO IC within a constant number of bits. In this section, we 
develop a simple HK coding scheme corresponding to a single joint distribution of input and auxiliary 
random variables - that does not belong to the class of distributions that would be consistent with the 
prescription of |(24] - but that nevertheless has the desirable property of having a rate region that is within 
a constant gap to the outer bound of Lemma [T] and hence to the capacity region. First, in Section IIII-B 1 1 
we briefly review the original HK coding scheme ||4l and some recent developments in |[T5l . |[T4l for the 
discrete memoryless interference channel (DM-IC) and then apply those results to the Gaussian MIMO 
IC. In Section IIII-B 21 we give a novel interpretation for the outer bound of Lemma [T] which is used as 
the basis for the specification of a simple achievable scheme in Section IIII-B 3 1 The description of the 
simple achievable scheme includes not only the specification of the joint distribution of the input and 
auxiliary random variables but also the rate split between private and common sub-messages and this is 
detailed in Section IIII-B4I An inner bound on the achievable rate region of the simple HK scheme of 
this section is then obtained in Section IIII-B5I This inner bound is seen to resemble the outer bound of 
Lemma [T] from which the constant gap to capacity result is easily deduced. 
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1) A review of HK achievable region and related work: A concise description of known results in 
the literature on the achievable regions of the IC is given here which will later be used to connect and 
contrast them with the results of this work. 

On a DM-IC with transition probability P{Yi,Y2\Xi, X2), for any set V* of probability distributions 
P* which factors as 

P*{Q,Ui,U2,Wi,W2,XuX2) =P{Q)P{Ui\Q)P{U2\Q)P{Wi\Q)P{W2\Q) 

P{Xi\Ui,Wi,Q)P{X2\U2,W2,Q), (17) 

where P{Xi\Ui,Wi,Q) and P(X2|[/2, 1^2, Q) equals either or 1, let 

represents a set of sub-rate 4-tuples, where 

n^°i\P*) = {(ri„,ri^,r2„,r2^) : <I{Uf,Y,\W^,Wj,Q) ^ (18a) 

<I{W^;Y^\Ui,W,,Q) ^ 4; (18b) 

Tjn, <I{Wj■,Y^\U^, ly^, Q) ^ /c,; (18c) 

{nu + n^) <im^Wi-Yi\W,,Q) ^ (18d) 

{Tin + r,^) <I{U^, Wj;Yim, Q) ^ 4.; (18e) 

{ri^ + <I{W^ , Wj ■,Yi\U^,Q)^If^; (1 8f) 

{r^u + + r,«,) <I{U^, Wi, Wj-Yi\Q) ^ /g,} (18g) 

for i ^ j ^ {1, 2}. Further, for a set S of 4-tuples (ri„, ri^, r2u, T2w), let n(5) = {{Ri, R2) '■ < Ri ^ 
{fiu + riw),l < i < 2, for some (ri^j, ri^, r2u, r2iu) G 5}. 11(5) is hence a particular 2-dimensional 
projection of 5. Then from H we have the following theorem. 
Theorem 1 (H): The set 

^HK = n( U ^&k(^*)) (19) 

is an achievable region for the DM-IC. 

Remark 4: Let Ui {U2) and Wi {W2) represent the private and common parts of the message to 
be transmitted by Txi {Tx2), which hereafter will be referred to as the private and common message 
of Txi (TX2), respectively. Also, let r^u and represent the rates of information carried by Ui and 
Wi, respectively, for i G {1,2} and let Xi be constructed from Ui and Wi in such a manner that the 
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joint distribution P* {Q,Ui,Wi,U2,W2, Xi, X2) GV*. Then, Theorem [J] essentially states that, for any 
{riu,riu,,r2u,r2w) G ■^hk(-P*) the rate pair (rin + riw,r2u + r2w) is achievable on the DM-IC. 

Thus, for any given P*, Theorem [T] not only provides a set of achievable rate pairs of the channel in 
the form of H (7^hk(^*))' also provides the set of 4-tuples from which the rates of their private and 
public messages can be determined. However, to determine the achievable rate region of the channel, 
it is necessary to obtain the auxiliary sets 'R-^^^^P*) and 'Tl^^\P*) first. This indirect method can be 
avoided by using the equivalent description, denoted as 7^HK(-f*)' of n (7?-hj^(P*)~iI that was obtained 
by Chong et al. in Lemma 1 of |[T4l . stated below for easy reference. 

Lemma 2 (Lemma 1 in EJ): For a fixed P* G V*, let ■7^^K(-P*) be the set of rate pairs (-Ri,i?2) 
satisfying: 



Ri <I{Xi 


Yi\W2,Q); 




(20a) 


Ri <I{Xi 


Yi\Wi,W2^Q) + I{Wi;Y2\X2,Q)-^ 




(20b) 


R2 <I{X2 


Y2\Wi,Q); 




(20c) 


R2 <I{X2 


Y2\Wi,W2,Q) + I{W2-^Yi\Xi,Q)- 




(20d) 


Ri + Ra <I{X2 


Wi-Y2\Q) + I{Xi-Yi\WuW2,Q); 




(20e) 


Ri + R2 <I{Xi 


W2;Yi\Q) + I{X2\Y2\WuW2.Q)- 




(20f) 


Ri + R2 <I{Xi 


W2;Yi\Wu Q) + IiX2, Wi-Y2\W2, Q); 




(20g) 


2Ri + R2 <I{Xi 


W2]Yi\Q) + I{Xi-Yi\Wi,W2,Q) + I{X2 


Wl-Y2\W2,Q)\ 


(20h) 


Ri + 2R2 <I{X2 


Wi-Y2\Q) + I{X2;Y2\WuW2.Q) + I{Xi 


W2;Yi\WuQ). 


(20i) 



The Han-Kobayashi achievable region is given by T^hk — Up*gp*'7^hk(^*)- 

Note that the achievable rate region 7^hk(^*) i"^ Lemma |2] is now specified directly as a set of rate 
pairs (i?i,i?2) defined through constraints (|20al l- (|20il l. 

The set of rate pairs R2) constrained by all the bounds of equation (l20l i except (I20bl ) and (I20dl ) 
was defined and called the "compact rate region" in [14|. In the rest of this paper, we shall denote this 
set with input distribution P* by 7^^K(-P*)• Clearly, 1Z1^^{P*) C W^^iP*) and it can be shown that 
there exists channels P{Yi,Y2\Xi,X2) such that 7^^^K(-P*) / ^&k(-P*) (e-g-> see Fig. [6(5)] ). 



'We use the superscript "o" to refer to the original HK coding scheme O and "e" to emphasize that 7Jhk(^*) is an equivalent 
description of \l{7i'^Y^{P*)). 
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Remark 5: Lemma |2] was proved by showing that for any given P* G "P*, 

TZ^^iP*) = n (7^SK(^*)) = n {n^°i\p*) n 4f . (2i) 

An equivalent description for 11 (7^hk(^*)) was derived earlier in lITSl using the Fourier-Motzkin elimi- 
nation method on the set of constraints given in equation ([TSl l which have two additional constraints on 
(2i?i + R2) and {Ri + 2R2) besides those in Lemma |2] Later, in |[T4l these bounds were shown to be 
redundant resulting in Lemma |2] 

The above discussion is summarized in the following schematic diagram. 

n'ii:\p*)r\n'ii\p*) — ^(7^^iK(P*)) =7^f^K(^'*) — - iJp^ev^nt,^{P*) 

^ II 

^&k(^*) Up.ep.7^S^K(^*) 

Remark 6: Recently, an alternative proof of Theorem 2 of [[T4] was given in |[29ll . That proof is 
based on the fact that the rate region lZ{a{Pi) is achievable by a single input distribution of the form 

Pt{Wi,Xi, W2,X2,Q) = P{Q)P{Xi\Q)P{X2\Q)P{Wi\Xi,Q)P{W2\X2,Q), where TZUP^) is the 
two-dimensional projection, i.e., 

T^miPi) = {{R11R2) ■■ Ri = {nu + riw), and (ri„, ri^, rs^., rs^) G "/^-^^(A*)} > 

of the set of 4-tuples, denoted as t6^\p^), defined as (see equations (224)-(233) of US) 

T^t\Pl) = [{riu,nw,r2u,r2^) : <I{X,-Y,\Wi,Wj,Q)- 

{Tin + Tin,) <I{Xi;Y,\Wj ,Q)] 

[viu + rju,) <I{Ui,WJ■Yi\Wi,Q)■ 
{r^u + + +rj^) <I{X„ Wj;Yi\Q) for i = 1, 2} . 

The expression for Tl\n{P^) is denoted as TZcmg{P*) in Lemma 4 of |[T4l . and was computed in 
Theorem D of ifTSl . The expression for 7^cmg(^i ) has two additional constraints than those that define 
■^HK(-fr) (e-g-' see Lemma 4 of HI or Theorem D in [15]). In other words, 7^in(Pl*) / T^hkI^i) for 
all inputs of the form P*. However, it was proved in OH that Up*gp*7?,in(Pi ) = Up'^-p'TZ^^^P^). This 
result can be pictorially represented as shown in the following diagram. 
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n(.) 



"^CMG (-PT) 



Up*gP'7^CMG(-Pl ) 



7?^ 



•HK 



UprePr^&K(^i*) 



Using standard techniques (of. Chapter 7 of |]30|) these discrete-alphabet results can be applied to 
the Gaussian IC with continuous alphabets. To distinguish them from each other, the rate regions 



corresponding to TZ'^^{P*), 7^^°K^(P*), 7^f^K(-P*) and TZf^^iP*) in the Gaussian IC will be denoted 
as n^iiP*), 7^^^K°''^(P*), n^iiP*) and 7^g^(P*), respectively. 



Evidently, both the original description of Theorem [T] and the alternative description of Lemma |2] of the 
HK coding scheme are given as a union of an infinite number of atomic sub-regions, each corresponding 
to a particular input distribution and time sharing strategy. Since a complete characterization of this region 
is prohibitively complicated, we seek in some sense a single good input distribution and time sharing 
strategy. Indeed, we provide a novel and important operational interpretation of the bounds of Lemma [T] 
through which such a good choice of input distribution becomes apparent, leading to a simple HK coding 
scheme. Moreover, this simple HK coding scheme has a property of being universally good in that it 
achieves a rate region that is within a constant number of bits to the set of upper bounds of Lemma [T] 
independently of SNR and the channel parameters. 

2) An interpretation of the outer bound of Lemma\I} The first two bounds in TZ^CH, p) come from the 
rate bound on a point-to-point channel. The first term of the third bound given in ^ represents the sum 
rate upper bound of a 2-user multiple-access channel (MAC) having channel matrices Hi2, for i = 1,2 
and Gaussian input with zero mean and scaled identity matrix as covariance. The second term represents 
the mutual information on a point-to-point channel whose input covariance matrix is Ki (see Q for the 
definition of Ki). These terms can be given the following operational interpretation. The entire message 
of Tx2 has to be decoded at Rx2 and some part of Txi might be decoded at Rx2. Let us call this 
the public message of the first user, denoted as Wi having rate Riw Subsequently, let us denote the 
remaining part of the first user's message by Ui having rate Riu which will be referred to as the private 
message of the first user. Thus we have Ri = Ri^ + Now, with respect to Wi and X2, Rx2 acts 
as a MAC and thus has the following upper bound 



Riw + R2< log det In.2 + P12H12H 



12 
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On the other hand, since Ui has to be decoded at Rxi, it has the following point-to-point channel upper 
bound 

Riu < logdet (^In, + piiHiiKiH\^ , 

where Ki is the covariance matrix of Ui . These two bounds together imply the third bound in Lemma [T] 
The 4*^' bound can also be interpreted similarly just by interchanging the role of transmitters. The first 
term of the fifth bound can be thought as a bound on the private message of Txi and the public message 
of Tx2 which are to be decoded at Rxi, i.e., 

Riu + R2w < logdet (Jn, + ^21-^21^^21 + PuHiiKiHI^^ , 

where the private message has the same covariance matrix as before. Similarly, the second term in the 5*'' 
bound can be interpreted as an upper bound on {Riw+R2u), and together, they imply the fifth bound. The 
other terms of the remaining bounds can be similarly interpreted. This interpretation motivates a simple 
HK scheme, where Q is a deterministic number (no time-sharing), the i*^' user's message is divided into 
a private and a public message and the private message has an input covariance matrix proportional to 
Ki. 

3 ) The simple HK coding scheme: The interpretation of the outer bound in the previous section inspires 
the following simple coding scheme. 

Definition 2 (Input distribution for the simple coding scheme): Let the private and pubUc mes- 
sages of the users be encoded using mutually independent random Gaussian codewords and the overall 
codeword is a linear or additive superposition of the two, i.e., the transmit signals for any particular 
channel use can be written as 

^ ^ ^ (22) 
XI = Ui + wl 

where Uf ~ CAA(0, Kiu) and Wf ~ CM{0, Kiw), represent symbols of the codewords of the private and 
public messages of user i, respectively and 

K,^in) ^ E{W[Wf^) ~ {hu - Ki) . (24) 

The scaling by -p- is required to satisfy the power constraint (H). In the sequel, we shall refer to such a 
superposition coding scheme where the covariance matrices of the private and public messages of user i 
are given by Kiu and Ki^ will be referred to as the T-LK, {{Kiu, Ki^, Kiu, Ki^}) scheme. In particular. 
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when Kiu and Ki^ are as in equation (|23] | and (|24] |. respectively the coding scheme will be denoted as 
HKS^^ (the superscript s stands for simple). Let us denote the distribution of the random variables as 
defined above by PsiUf, Wf, X^, U^, VF|, X2). Clearly, P,(C/f , Wf,Xf, Ul, W^, Xf) G V*. 




Fig. 3: The equivalent virtual channel for the simple HK coding scheme. 

Remark 7: The above choice ensures that the private message of user i, the covariance of the 
contribution of which at Rxj (namely, .JpijHijUf), is given by 

^ I Mi, 

and thus reaches the unintended receiver below the noise floor. Thus the simple achievable scheme here 
when specialized to the SISO IC embodies the key principle in the achievable scheme of |2ll. It is of 
course applicable much more generally to MIMO ICs and cannot be as such inferred in its general form 
from just that principle alone. Moreover, we will also soon see that even when specialized to the SISO 
IC the details of the power split between the public and private messages resulting from this work are 
different from that in fl\\. 

The T-LIC {{Kiu, Kiw, K2u, K2w}) coding scheme thus effectively divides each user into two virtual 
users as shown in Fig. |3] Note that the interference links from the first virtual user to Rx2 and the fourth 
virtual user to Rxi are made very weak so that any signal along those links always reaches the receivers 
below noise floor. As shown in the figure, the channel can be thought as two interfering MACs where 
Rxi jointly decodes Uf, Wf and W?^-, treating Uj as noise for 1 < f 7^ j < 2. 
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Applying Theorem [T] and Lemma |2] for the Gaussian IC and evaluating it for the distribution, Ps{-) of 
Definition |2] we get the following achievable region for the 2-user MIMO Gaussian IC. 

Lemma 3: On a 2-user Gaussian MIMO IC, the simple T-LIC {{Kiu, Ki^,K2u, -f^2to}) coding scheme 
can achieve the rate region, TZ^^{Ps), which is a set of rate pairs R2) where Ri's satisfy the following 
constraints 



i?i <IiXf;Yf\Wiy, 

R, <IiXf-Yf\Wf,Wi)+IiWf;Yi\Xly, 
R2 <IiXl;Yi\Wfy, 

R2 <iixi-Y,^\wf,w^)+iiwl,Yf\xfy, 

Ri + R2 <I{Xl Wf-Yi) + /(Xf ; Yf\Wf, W^y 

Ri + R2 <i{xf, wi,Yf) + 1 {XI- Y3\wf, w^y 

R^ + R2 </(Xf , Wi;Yf\Wf) + Wf;Yi\W^y 
2Ri + R2 <I{Xl Wl-Yf) + I{Xl- Yl\W(, Wl) + I {XI, Wf;Yi\W^y 
Ri + 2R2 <I{Xl, Wf;Yi) + I {XI; Y^'\Wf, W^) + I{Xf, Wl,Yf\Wf), 



(25a) 
(25b) 
(25c) 
(25d) 
(25e) 
(25f) 
(25g) 
(25h) 
(25i) 



where Yf's are the outputs of the 2-user MIMO IC when its inputs are Gaussian as stated in Definition [H 
Further, 

n^i{Ps) = n (n^i{Ps)) = n (n'^^-'\Ps) n ^^-'^(p. 

where 



T^hk'Hp*) = {riu,n^,r2u,r2^) : <I{Xf;Yf\Wf,W^y 



Tin, <I{Wf;Yf\Uf,W^'y 
rj^ <I{W^';Yf\Wf,U^y 
{r^u + r^^) <I{Xf;Yf\W^y 
{r^u + r,^) <I{Uf,W^';Yf\Wfy 
{r^^ + <I{Wf, W^;Yf\U^y 
)<I{U^,W[,W^;Yf)] 



(26a) 
(26b) 
(26c) 
(26d) 
(26e) 
(26f) 
(26g) 
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for i j £ {1,2} and 

I{Xf- Yf\Wf, Wl) =log det (In, + piiHuKiuHl, + P2i^2ii^2.i^li) - rsi; (27) 

I{Wf, ■ Yf\Wl Uf) =log det (ItVi + piiHuKi^Hl + /92iF2ii^2«^li) - r2i; (28) 

/(t^l; y/|Xf ) =log det I^In, + f|^2ii^ii) - ^2i; (29) 

/(Xf ; y/|T^|) =log det (^In, + f^^n^ii + /02li^2li^2^.^|l) - rai; (30) 

I(Xf , Wl, Yf\Wf) =log det (^/jv, + f|^2ii^ii + PuHuKi^hI^ - T21; (31) 

/(VFf , |C/f ) =log det (^In, + f|^2ii^li + puHuKiy^Hl^ - T21; (32) 

I{Xf, Wl- Yf) =log det (^In, + ^^H2iHl, + g-ifni^Ji) - r^i, (33) 

= logdetilN,+ PijHijK^uHl) for i / j G {1,2} and I{Xl;Y^^\Wi ,Wf) through /(X|,t^f;y/) 
are obtained by swapping the indexes 1 and 2 in the above set of equations, where Kiu and Ki^ are 
given by equation (1231 ) and (l24l ). respectively for 1 < i < 2. 

Proof: Equations (|25] ) and (l26l ) result from a simple application of the DM-IC result of Lemma |2] 
and Theorem [U to the Gaussian channel. Equations (|27])-(|33]) are obtained by evaluating the different 
mutual information terms in equations ([TSl l for the given distribution of Uf, C/f > > -^1 ^^'^ ^2 
in Definition m ■ 
Remark 8: In this remark we substantiate the claim that the simple coding scheme of Definition |2] does 
not fall into the set of admissible coding schemes of ll24l the union of whose rate regions constitutes 
the achievable rate region therein. In particular, suppose we consider the input to be distributed as 
Gaussian, i.e., Xi ~ CM{0, Sj) where Sj satisfies the power constraint ([1} for both i = 1, 2. Then 
according to the coding scheme of ll24l the random variable Wi (note this would be denoted as Ui 
in ll24l ) which corresponds to the common message of the i-th user, is conditionally independent of, 
but identically distributed as. Si given Xi. This in turn implies that the conditional distribution of Wi 
is CJ\f{^/pijHijXi, In.) and its marginal distribution is hence CM (^0, {I + pijHij'EiHjj)^ . Note that, 
depending on the channel matrices, the covariance matrix of Wi can itself be larger (in partial order) than 
the covariance of the input Xi (e.g., when Hij = I). Therefore, Wi cannot be a signal that can propagate 
through the channel. In the admissible coding schemes of ll24l therefore, unlike in the coding scheme 
of Definition [2l Wi serves as an auxiliary random variable which is used to generate the transmitted 
codeword for the user. In other words, the prescription of ll24l can not lead to any explicit additive 
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superposition coding scheme specified by Definition |2l 

4 ) The rate splitting strategy for the simple HK coding scheme: Note that in the simple HK coding 
scheme, each user has 2 messages: a private message Uf, which is to be decoded at its own receiver, 
and a common message Wf, which is to be decoded at both the receivers. Therefore, to achieve a rate 
pair R2) € ^hk(^s) rates of these messages have to be chosen in such a way that each can be 
decoded at their respective receivers with arbitrarily small probability of error and 



{riu + rj^) = Ri, y i G {1, 2}, 



(34) 



where r^^ and rj^ are the rates of the private and public messages of user i, respectively. The second 



part of Lemma |3] provides such a set (namely, Tl^^{Ps)) from which these sub-rates can be chosen 



Since, TZ^KiPs) = U[n'^^{Ps)), for every (i?i,i?2) G TZ'^iiPs), by the definition of n(.), there exists 



at least one 4-tuple {riu,riw,r2u, 'r'2w) G ^hk(^s) such that {riu + Vi^) = Ri for both i = 1,2. On the 



other hand, by Theorem [T] for any {riu,riw,r2u,r2w) G T^HKi^s), if and r^^ represent the rates of 
information carried by Uf and Wf, respectively, then the simple HK scheme can achieve the rate pair 
{riu+riw, r2u+r2w) (recall Remark|4]), i.e., C/f , Wf and Wj can be decoded at Rxi with arbitrarily small 
probabiUty of error, for i j e {1, 2}. So, the rate splitting strategy of the 'H)C{{Kiu, Ki^,K2u, K2w}) 
scheme can be summarized as follows. 

For any R2) G TZ^^{Ps) and the simple HK coding scheme of Definition[2l choose (ri^, ri^, r2u, f'2w) 
from the atomic sub-region 7^^^(Ps) in such a way that (r^u + Viyj) = Ri (the existence of which is 
now guaranteed by Lemma |3]l, assign rate riu to the private message Uf and to the public message 
Wf of user i and then transmit the signals using the additive superposition coding scheme specified by 
Definition |2l On the decoding side, Rxi can jointly decode Uf , Wf and treating C/J as noise for 
i / j G {1, 2}, with vanishing probability of error (e.g., see Theorem [1}. 

Example 2: Consider a 2-user Gaussian (2, 3, 2, 2) IC with p = [20, 8, 12, 20] dB, where the channel 
matrices are given as follows 



H 



11 



H 



12 



L1975 


- 0.4385i 


-0.0902 + 0.1895i 


0.3234 


- 1.3614f 


0.1330 - 0.2564i 


0.7546 


- l.OOSOi 


-0.3205 - 0.6958i 


0.9652 


- 0.8085? 


-0.3033 + 0.0055i 


0.6130 + 1.4479i 


0.6872 + 0.5280i 



21 



22 



0.3816 - 0.8508i 
-0.4892 - 0.2179i 
0.7665 - 1.0875i 

-0.1209 -0.4575Z 
-0.5730 + 1.1118i 



0.4450 - 0.4386i 
-0.5346 - 0.1519* 
0.1689 + 0.765H 

-0.0040 + 0.0921i 
-0.8223 - 0.5687i 



In Fig. |4] the dotted line represents the rate region achievable by the simple HK scheme and the solid 
line represents the superset Tl"-(T-L,p) which contains the capacity region of the channel. 
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6 



(in bpcu) 

Fig. 4: An achievable rate region of the simple HK scheme. 

5) The constant gap result for the simple HK scheme: It is not unreasonable to imagine that the 
gap between the boundaries of the achievable rate region and the set 7^" [T-L , p) can behave arbitrarily, 
including becoming unbounded sometimes, as a function of the channel matrices. However, in what 
follows we shall show that this gap actually remains bounded and can not be larger than a constant 
which is independent of the SNR, INR or the channel coefficients. This fact will be proved by showing 
that 7^hk(^«) contains a subset which is within a constant number of bits to the set of upper bounds. 
The following lemma specifies this subset. 

Lemma 4: The achievable rate region of the simple TilC {{Kiu, Ki^, K2u, K2w}) coding scheme 
employed on IC{'H, p), contains the region Tla{'H, p), which is a set of non-negative rate pairs satisfying 
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the following constraints: 

Ri < (logdet(/jv, - ni)^; (35) 

R2 < ( log det (/^, + p22H22HI^ - na) ^; (36) 

Ri + R2<[ log det (In, + pi2Hi2Hl^ + P22H22HI^ (37) 

+ log det (/tVi + - (ni + 112))^; (38) 

+ < ( log det [In, + P2iH2iHI^ + piiHuH\^ (39) 

+ log det {In, + P22H22K2HI^ - (ni + na)) ^; (40) 

Ri + R2<[ log det [In, + P2iH2iHI^ + piiHuKiHl^^ (41) 

+ log det (^In, + Pi2Hi2HI^ + P22H22K2HI^ - (m + n2)) ^; (42) 
2Ri + R2< (log det (/^, + p2iH2iHl^ + puHuHI^ + log det (/;vi + PhHuK^hI^ + (43) 

+ log det (/jv, + P12H12HI2 + P22H22K2HI^ - (2ni + na)) ^; (44) 

^1 + 2ii2 < ( log det (/jv, + Pl2Hl2Hl^ + P22H22HI^ + log det (Itv, + P22H22K2HI^ (45) 

+ logdet(/jv, + P2iH2iHl-^ + piiHuKiHlA - (m + 2713))^, (46) 



where J^j's are as specified before (see equation (l6]l) and nj's are given by equation dD for 1 < i < 2. 

Proof: The proof is given in Appendix |C] It is shown there that Tia{T~L, p) describes a polygon which 
is completely inside 7^^^(Ps), and is hence achievable by 'HK,^^\ The idea is to show that the bounds in 
(l35])-(l46l) are obtained by replacing the right hand sides of (I25al )- (l25il ) by their respective lower bounds 
(which in turn are the right hand sides of the bounds describing TZa{T-L,p) in this lemma). Appendix [Cl 
establishes these lower bounds. 

■ 

Note that each bound of Lemma |4] differs from the corresponding bound in Lemma [U only by a 
constant, from which we get the following constant gap to capacity result. 

Theorem 2: The rate region TZa {T~i,p) of Lemma |4l which is achievable by the simple HK scheme 
T-LK, {{Kiu, Kiw, K2u, K2w}), is within Ui bits to the capacity region of the Gaussian MIMO IC, where 
Ui is given by equation 

Proof: We need to prove that for any given {Ri, R2) G C {T-L, p), there exists a rate pair {Ri, R2) G 
T^a {T~L, p) such that Ri > Ri — Ui for 1 < i < 2, or equivalently, — ni)+, {R2 — J^a)"*") G '^a p)- 
This can be proved using Lemma [T] and |4] as follows. The proof is by contradiction. Using Lemma [1] 



January 11, 2013 



DRAFT 



23 

we have 

{Ri,R2)eC{n,p) ^{Ri,R2)(^lV'{n,p). 

Now, denoting Ri = {Ri — ni)~^ for i = 1,2, let us assume that {Ri,R2) ^ T^a{'H,p). This implies 
that one or more of the bounds of Lemma |4] are not satisfied by the rate pair (Ri,R2)- Without loss of 
generality, we assume that Ri > Ui, Mi because the other case follows trivially and the S^'^ bound is 
not satisfied, i.e., 

+ R2) ={Ri + R2- (ni + ns)) , 

> logdet [In^ + P12H12HI2 + P22H22HI2 



+ logdet [In, + piiHiiKiH\^j - (ni + 712) 
{Ri + R2) > logdet (In, + PuHuHI^ + ^22^22^2^2 , 



+ logdet (jjvi + piiHuKiHl^j . 
However, this implies that R2) ^ 7^" {'H,p), which clearly is a contradiction. 



C. An explicit coding scheme to achieve a smaller gap 

In this section, we propose what we call an explicit HK coding scheme that can be guaranteed to have 
a smaller gap to the capacity region than is possible with the simple HK coding scheme of the previous 
section. 

We begin with a heuristic discussion of what may be limiting the performance of the simple HK 
scheme. In particular, we argue that it is possible to choose the input covariance matrices for the private 
and public messages of each user in a rate-dependent manner so as to achieve a rate region that is larger 
than that of the simple HK scheme which is given as in Lemma [3] but without the 2"*^ and 4*'^ constraints 
(i.e., the bounds of equation (I25bl) or (I25dl )). To this end, we shall first identify the scenarios in which 
these bounds can be tighter than the corresponding bounds of equations (I25al ) or (I25cl ). Subsequently, it 
will be shown that by suitably choosing the covariance matrices for the private and public messages of 
each user, it is possible to ensure that such scenarios never arise. Thus we get a new rate region that is 
achievable by an explicit coding scheme and in which the rate pairs are constrained only by (I25at . (I25cl i 
and ( |25e| )-( [25i1) . It will also be shown that this new bigger rate region contains a subset which is within 
just n* bits (recall (|5]l) to the capacity region of the channel, thereby improving the constant gap result 
of the previous section. 
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Suppose there exists a rate pair {R\, R'2), such that i?-'s satisfy all the constraints of equation (1251) but 
(l25bl) . i.e., 

liXf ; y/ 1 Wf ,Wi) + IiWf; | X| ) < i?; < /(Xf ; | . (47) 

The maximum value of R\ G 7^h^(Ps) in such a scenario is restricted only by the bound in equation 
(I25bl ). However, comparing the two sides of equation (|47] ) we see that, the first term on the left hand side 
of equation (l47l ) differs from that on the right hand side only due to the extra in the conditioning. 
If all of the power is allocated to the private message only (i.e., X\ = Uf and Wf = (/)) then the first 
term on the left hand side alone is equal to the right hand side and equation (l47l ) can not be true. So, 
some fraction of the total power available at Txi is being used to send Wf which decreases the term 
I{Xf; Yf\Wf, )0 However, this decrease is more than the corresponding increase in the second terrr^ 
on the left hand side which in turn also suggests that the cross link from Txi to Rx2 is weaker in some 
sense than the direct link. 

Clearly, when equation (|47| | is true the rate pair {R^, R2) is not achievable by the simple HK scheme. 
The main flaw of the encoding technique in the above scenario is therefore that a significant fraction 
of the power is spent to send some common information {Wf) through a weak channel to a receiver 
{RX2) where the message is not even desirable. Intuitively it seems that, instead of wasting power on a 
weak channel, it is better if Txi chooses not to send any public information at all, i.e., set Ki^ = and 
assign all of its power to the private message. As mentioned earlier, if we put = Uf and Wf = in 
equation (07]), the strict inequality becomes an equality, i.e., 

I{Xf;Yf\Wf,Wf) + I{Wf;Yi\Xl) = I{Uf;Yf\Wf) = I{Xf;Yf\Wf), (48) 

and the two bounds in equation (I25al ) and (I25bl ) become identical. With such a power split it might turn 
out that the rate pair (i?^, R2) is actually achievable. That this is indeed the case is proved in Lemma |5] 
Example 3 (A case with no common message): Consider the 2-user Gaussian (2, 3, 2, 2) IC of Ex- 
ample 121 Computing the right hand sides of the bounds in equations (I25al) and (I25bl ) for this channel we 
get 

I{Xf,Yf\Wf, W^) + I{Wf;Y^\Xl) = 9.6572 < I{Xf,Yf\Wf) = 11.8524. 

^Because, I{Xf; Yf\W^, W2) represents the information carried by the private message of the first user, on the direct link. 
^/(Vrf; If'lJtf), which represents the information carried by only the public message of user 1 on the cross link in the 
absence of X2. 
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In Fig. |5]the dotted line represents the rate region achievable by the simple HK scheme of the previous 
section and the solid line represents the rate region achievable by the simple HK coding scheme when 
Txi uses all its power to send the private message only, i.e., Kiy^ = 0. This figure illustrates that it is 
indeed possible to achieve a rate pair outside the rate region TZ'^^{Ps). For example, on the particular 
channel of Fig. [5j point A is achievable by the coding scheme ?^/C({ jg^/jv/i, 0, K2u, K2w}) but not by 
the simple TilC^'^^ scheme. 




(in bpcu) 



Fig. 5: Comparison of the achievable rate regions of the simple HK scheme and the HK scheme with 

no public message for the first user. 



Remark 9: Fig. [5] points out another salient but important point regarding the usage of full power for 
the private message only. Note that the achievable rate region of the 'H}C{{jj^Imi,0, K2u, K2w}) (the 
region marked by the solid line in the figure) is not strictly larger than 7^hk(^s)- For instance, point 
B in Fig. [5] can not be achieved by the 'HlC{{^I]\j^,0, K2u, K2w}) scheme but it can be achieved by 
the simple HK scheme of the previous section. Thus, it is not helpful to set Wf = whenever (I25bl i is 
tighter than (|25a| ). 

The above discussion motivates a rate (and channel) dependent covariance splitting strategy for the 
private and pubhc messages of each user. Before describing it, let us define a rate region 7^2 to be a set 
of rate pairs {Ri, R2) which satisfy all but the bounds ( |25bl ) and ( |25d| ) in the achievable rate region of 
Lemma [3] 

Definition 3 (The rate region 7^2): We call the set of rate pairs (i?i,i?2) that satisfy the following 
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set of equations, TZ2'- 

Ri<I{Xl-Yf\Wi); (49a) 

i22 </(X|;y/|T^f); (49b) 

Ri + R2 <I{Xl , Wf ; y/) + I{Xf ; Yf \ Wf , ; (49c) 

Ri + R2 <I{Xf , Wl ; y/) + I {XI ; y/ 1 Wf ,W^); (49cl) 

Ri + i?2 <IiXf , Wl ; y/ \Wf) + I {XI , T^f ; y| I W|) ; (49e) 

2i?i + R2 <I{Xf , W| ; y/) + I{Xf ; y/ 1 W^f ,W^) + I{Xl , VFf ; y/ 1 T^l) ; (49f) 

+ 2i?2 </(X| , < ; y/) + I {XI ; y2^ I < ,Wi)+ I{Xf , ; Yf \Wf), (49g) 

and /(.; .|.)'s are given by equation (l27])-(l33Tl. 
Evidently, 112 = ■^hk(^s) (e.g., see Subsection llLBU). 

Definition 4 (The new explicit coding scheme (with rate dependent covariance split)): Consider a 
modification of the simple HK coding scheme of Definition [2] where each of the transmitters chooses its 
covariance split between its private and public messages depending on the rate pair to be achieved in the 
following manner: 

1) (i?i, R2) G 7^2 but Ri's violate constraint (I25bl ): Txi assigns all its available energy to its private 
message only, i.e., the coding scheme 'HK,{{jj^Imi,^, K2u, K2w}) — TilC^'^^^ is used. 

2) {Ri, R2) G 7^2 but Ri's violate constraint (|25d| ): Tx2 assigns all its available energy to its private 
message only, i.e., the coding scheme 'HK,{{Kiu,Kiu,, 75^-^1/2) 0}) = 'HIC^'^^^ is used. 

3) (i?i,i?2) G T^'ukiPs), i-^-, Ri's violate neither (I25bl ) nor (I25dr l: the simple HK coding scheme 



'HlC{{Kiu,Kiw,K2u,K2w}) is used, where Kiu and Kiyj are chosen according equation (|23T l and 
dH for both z = 1,2. 

An HK coding scheme which uses mutually independent Gaussian codewords to encode the private 
{Ui) and public {Wi) messages of each user, where the covariance matrices for the different messages 
are chosen as described above, will be referred to as the explicit HK coding scheme and will be denoted 
by UK,. 

Lemma 5: The explicit HK coding scheme T-LK, employed on XC{T-L, p) has an achievable rate region 
that contains the rate region 1Z2- 

''it will be shown in the proof of Lemma |5] that there does not exist any rate pair (Ri, -R2) G 7?,2 which violates both J25bt 
and J25dt simultaneously. 
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Proof outline: The detailed proof is given in Appendix ID] which makes precise the heuristic argument 
in the beginning of this section that lead to the definition of the explicit HK scheme of Definition |4l We 
provide here an outline. 

We know that every point in TZ'^^{Ps) is achievable by the T-LK}-^^ scheme. Using this result we prove 
the lemma by showing that every rate pair that lies in 7^2 but not in Tl'^^iPs) can also be achieved by 
the explicit coding scheme T-LK, of Definition ID 

It can be easily seen that (see Appendix iDl). when (i?i,i?2) G Tli but {Ri,R-2) ^ TZ'^^{Ps) both of 
(|25a| ) and (|25d| ) can not be violated simultaneously because this would imply that {Ri + R2) violates 
equation ( |49e| ). thereby contradicting the assumption that {Ri,R2) G 7^2- On the other hand it can be 
shown that when (I25bl ) is violated for a given {Ri,R2) G 7^2> the ^{^.{{^ImijO, K2u, K2w}) scheme 
- in which user one assigns all its power to the private message - can achieve the rate pair. When (|25dl i 
is violated, the ?^/C({Kitt, Ki^^,, -^/a/^, 0}) scheme can achieve the the corresponding rate pair. This 
however is exactly the coding scheme in Definition ID Therefore, the UK. scheme can achieve any rate 
point in 7^2- ■ 

It is clear that, depending on the rate pair to be achieved, the explicit HK scheme uses one of the three 
simple HK coding schemes specified in Definition [H The corresponding input distribution when Txi 
spends all its power to send the private message only, was denoted by Pg^ (e-g-, see Appendix ID)) in the 
proof of LemmalU where Pg^ (.) G V*, for i = 1,2. The achievable rate region of the simple HK scheme 
with input distribution Pg^ is given by Tl^^iPsJ, where Tl^^{PsJ can be computed as in Lemma [3l As 
was argued earlier, to achieve a point in this rate region, it is important to choose the sub-rates carefully. 
In particular, to achieve any rate pair (i?i,i?2) G TZ^^{Ps,), the corresponding sub-rates for the private 
and public messages can be chosen from 7^^^(Ps. ) since 7^hk(^s.) ~ ^ (^^hk(-^s. )) LemmaO This 
suggest the following rate splitting strategy for the explicit HK scheme, T-LIC. 

Rate splitting strategy for the explicit HK coding scheme: Depending on the rate pair to be achieved 
when the input distribution of the coding is P, where P G {P^, , P^^}, the sub-rates for the different 
private and public messages are chosen from TZ^^{P), where 7^hk(-^) computed from equation 

(l26l) by using distribution P in place of Pg. 

Remark 10: Since the sub-rates are chosen from Tl^^{P), when Txi spends all its power to send the 
private message only, i.e., W[ = 4>, it is expected that 7^hk(^) should not allow any positive rate for 
the common message. Putting Wf = (j) in equation (I26bl ) it can be easily seen that it is indeed the case, 
i.e., Ti < 0. 

Remark 11: It is worth pointing out the differences between the explicit coding scheme of this 
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paper specialized to the SISO IC and that in ||2TI . where a simple coding scheme was also suggested 
to characterize the capacity region of the SISO IC within one bit. The authors in iBTl use a linear 
superposition coding scheme where each users private and public messages are encoded using independent 
Gaussian random codewords with powers Piu and {Pi — Piu), respectively for i = 1,2. Here, Pi is the 
total average power of Txi and Piu depends on the cross channel coefficients as follows (see equation 
(57) and (58) of EH) 

Piu = min{Pi, ^^}, i^je {1, 2}. (50) 



In the notation of the present paper this coding scheme is identical to 'HIC{{^, ^1 — -^j , 

1 - ^^}), when Pi > -p^}- On ^e other hand, it is identical to nJC{{l,0, (^1 - -^)}), 
when only Pi < pj^}, TiJC{{^, ^1 - , 1,0}), when only P2 < jj^^} and it is identical to 
T^/Cdl, 0, 1, 0}), when Pi < pj^jp} for both i = 1,2. So, depending on the channel coefficients the 
coding scheme is equivalent to one of the four schemes just described. However, for a given channel 
the coding scheme and power allocation of lIlTI is fixed and does not change with the rate pair to be 
achieved. However, the explicit coding scheme of this paper utilizes one of the three different power 
splitting schemes depending on the rate pair to be achieved. 

Moreover, in contrast to [21], the description of the T-LK. scheme here explicitly specifies the sub-rates 
of the different messages for each rate pair to be achieved in 7^2- 

Remark 12: There is also a subtle difference between the T-LIC scheme and a coding scheme which 
time shares between the three simple HK schemes of Definition |4l In general, the latter can achieve a 
larger rate region. This is the case because the HIC scheme does not use 7ilC{{^lMi , 0, K2u, K2w}) or 
%lC{{Kiu, Kiw, jf^hhjO}) to achieve any point that is not inside 7^2- Whereas, both of 7i)C{{^lMi,0, 
K2U, K2w}) and ^{^.{{Kiu, Ki^, ■^Im2, 0}) may achieve points which do not lie in 7^2 but are achiev- 
able by a time sharing scheme. Fig. |6] illustrates this point through an example. In particular. Fig. 6(a) 



depicts the achievable rate regions of the three simple HK schemes for the channel of Example |2l In Fig. 
|6(b)| the rate region bounded by the dotted-dashed line represents the achievable region 7^2 of the explicit 
HK scheme and the dashed line represents the rate region, 7^t5> achievable by time sharing. Point A in 
the latter represents a rate pair which lies in the achievable region of T-LICdjj^lMijO, K2u, K2w}) and 
hence also lies in TZts but it is clearly outside 7^2 and hence not achievable by T-LJC. There are of course 
rate pairs in TZts that don't even lie in the union of the three regions of Fig. |6(a) as seen in Fig. |6(b)| 
The advantage of T-LJC however is of course that it does not use time-sharing and moreover, it is not clear 
that the time-sharing scheme here necessarily leads to improved performance in terms of guaranteeing a 
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smaller gap to the capacity region. 




(in Bpcu) (in Bpcu) 

(a) Achievable rate regions of the component schemes, (b) Rate regions of the explicit scheme and time sharing. 



Fig. 6: Comparison of the achievable rate regions of the explicit scheme and the region achievable by 
time sharing among the component schemes, on the channel of Ex. |2l 



Remark 13: Recently in |[29l . an alternative proof of Theorem 2 of |fl4| was given (e.g., see Remark[6]|. 
The distinguishing aspect of this result from that in iPHI is that, the rate region Tl\n{Pi) for each is 
achievable by a single input distribution. It might therefore might appear that 7^hk(^i ) achievable by 
a single input distribution and therefore 7^-2 is also achievable by a single input distribution. However, 
as explained in Remark |6l the expression for Tlin{P*) has two extra constraints than those that define 
^HK(-fr)- other words, TZ\n{Pi) may not be equal to TZ^^{Pi) for all inputs of the form P^. Therefore, 
the alternative proof of ||29l does not suggest that 7^2 is achievable by a single input distribution. 

From the fact that the rate region 7^2 is larger than Tl^^{Ps), it might be possible to identify a subset of 
7^2 whose boundary is also at most a constant number of bits from the set of upper bounds in Lemma [1] 
The interesting point here is that this constant can now be smaller than n/. Indeed, the following lemma 
provides such a subset of rate pairs. 

Lemma 6: Let Tl*{'H,p) be a set of non-negative rate pairs (iii,i?2) which satisfy the following 
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constraints: 



< ( log det (^In, + piiHiiH\^ -n\^ , 
R2 < (^logdet(^lN, + P22H22HI2) -n*2 
Ri + R2< (logdet(/7V, + P12H12HI2 + P22H22HI2 



+ log det (/jvi + puHuKiHI^^ - {nl + n^) 
Ri+R2< (log det + P2iH2iHI^ + piiHiiH\^ 

+ log det (In, + P22H22K2HI^ - {n[ + nl) 
Ri+R2< (log det (/^, + P2iH2iHI^ + puHuKiHI^ 

+ logdet(/7v, + PuHuHI^ + P22H22K2HI^^ - {n\ + nl) 
2Ri +R2< (log det (% + p2i^2i^li + PuHuHI^ + log det (l^v, + piiHuK^H[^ 
+ log det (/^, + P12H12HI2 + P22H22K2HI2) - {2nl + nl) 



Ri + 2R2 < (logdet(/Ar^ + P12H12HI2 + P22H22HI2) + log det (In^ + P22H22K2H} 



22 



+ logdet(^/7v, + P2i^2i^2i + PiiHuKiHl^j - {nl + 2n^ 

where n* as defined in Then, TZ*^{T-L,p) is an achievable rate region on IC{'H,p) and is achievable 
by the explicit HK coding scheme T-LJC, i.e., 

Proof: It is sufficient to prove that the set of bounds on the different linear combinations of the rate 
tuples {Ri, R2) in the above lemma are stricter than those given in the expression for 7^2> i-c, equations 
(|49l ). Now, on one hand, the bounds of equation (|49l ) are same as those in ( |25a| ). (|25c| ) and (|25e| ) - (|25i| ). 
On the other hand, in the proof of Lemma |4] given in Appendix |C] a stricter set of bounds than those in 
equation (I25al )- (l25il ) was computed (see equations (l68l ) in Appendix 0. Clearly, equations in (I68al l. (I68cl ) 
and ( |68e| )- (|68i| ) represent a set of stricter bounds than those in ( |49l ). The set of rate tuples defined by 
(|68aK (|68c| ) and ( |68e| )-( [68ll ) is a subset of 7^2- The Lemma then follows from the fact that these bounds 
are exactly same as those in the lemma. ■ 
Theorem 3: The achievable rate region TZ* {T-L^p), given by Lemma [6l is within n* bits to the capacity 
region of the Gaussian MIMO IC, where n* is given by equation (jSj). 

Proof: The proof is identical to that of Theorem |2] ■ 
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Corollary 1: The achievable rate region 1Z*^{T-L,p) is within n* bits to Tl'^{'H,p). 

Proof: Follows from the proof of Theorem |2] with TZaCH, p) replace by 7^* ("H, p) and by n*. ■ 

Example 4: Capacity of the SIMO IC within 1 bit: On a (l,iVi, l,iV2) IC, n\ = n*^ = 1, thus the 
explicit T-LK, scheme can achieve a rate region which is within 1 bit of the capacity region for any SNRs, 
INRs and the channel vectors. This result is different from that reported in [23] where the exact sum 
capacity of the strong SIMO IC with lli/jip < ||-f^tj|P for 1 < f / j < 2, was characterized. While [23] 
provides the exact sum capacity for the strong SIMO IC, our 1 bit approximation is valid for all channel 
coefficients. Further, this approximation is tighter than that reported in ll24l and ||25| . where the capacity 
approximation within Ni bits was proved. 

Remark 14: Although the above approximate characterization is not always better than that reported 
in ll24l . it provides a better approximation for a large class of interference channels. In particular, for all 
the interference channels on which n* < Ni the explicit T-LJC scheme provides a tighter approximation. 
Among the other interesting aspects of the approximate characterization of this section are (a) we have 
a set of explicit expressions for the achievable region and upper bounds to the capacity region, which 
for instance, can be used for a further analysis such as the evaluation of the generalized degrees of 
freedom region (which is reported in the companion paper [|26l ) and the diversity-multiplexing tradeoff 
(DMT) analysis obtained by the authors in ||3TI and |[32l : and (b) an explicit coding scheme, involving 
just three linear superposition strategies was shown to be approximate capacity optimal in contrast to the 
result of ll24l where no light is shed on what simple or explicit scheme, if any, out of all possible input 
distributions and all possible time sharing schemes, would be approximate capacity optimal. 

D. Reciprocity of the approximate capacity region 

For a communication channel with an unequal number of antennas at the source and destination nodes, 
how does the capacity (or any other performance metric) change if the information flows in the opposite 
direction (i.e., the roles of the transmitters and the receivers are interchanged)? The property of maintaining 
the same performance even if the direction of flow of information is reversed is widely known as the 
reciprocity of the channel. For instance, the following reciprocity of the point-to-point MIMO channel 
was proved in ||33l : the capacity of a MIMO point-to-point channel is unchanged when the roles of the 
transmitters and receivers are interchanged provided the power constraint is appropriately scaled. In Il34l 
the degrees of freedom (DoF) region of a (Mi, A^i, M2, N2) MIMO IC was shown to be the same as that 
of a (TVi, Ml, iV2, M2) IC. In this section, we prove a reciprocity result for the (Mi, iVi, M2, iV2) MIMO 
IC by showing that reciprocity actually holds in the much stronger constant-gap-to-capacity sense. 



January 11, 2013 



DRAFT 



32 




(a) Information flowing in the reverse direction in the (b) Equivalent channel, information flows in the forward 
(Ml , iVi , K'h , iV2 ) IC. direction. 

Fig. 7: Infomation flowing in the reverse direction on an 2-user MIMO IC and its corresponding 

forward information flow model. 



Fig. |7(a)| illustrates an (Mi, A^i, M2, MIMO IC with channel parameters H and p with roles of 
the transmitters and receivers interchanged so that information flows in the reverse direction. Fig. |7(b)| 
shows its equivalent model where the information flows in the forward direction. Clearly, the capacity 
of the reverse channel is the same as that of ZC (T-L^ , ff) where T-L^ = {Hf^, HJi, Hf2, H22} and p"^ = 
[/O, P2I1 Pi2) /022]- The capacity region of the reverse channel is denoted as C {W ,p^'). 

Let us define the counterparts in the reverse channel of the capacity gap parameters of the forward 
channel in ^ as 

m* ^ mm{Mi,Ns} \og{N^) + rfnj, 1 < i / j < 2, (51) 

where Ns = {Ni+N2), = max{iVi, Af2} and ifnj = niij log for 1 < i / j < 2. 

To prove the reciprocity in the constant gap to capacity sense, the capacity regions of IC {T-L-,p) and 
XC {T-L^, p^) must be shown to be within a constant number of bits to each other. We start with a result 
on the outer bounds. 

Lemma 7: The outer bound IZ^ {T~L,p) from Lemma [T] of the forward channel XC {Ti, p) and the outer 
bound 7^" {W ,p'') (obtained in the same way as in Lemma [T] but for the reverse channel XC (T-L^^ff)) 
define the same set of rate pairs, i.e., 

TV'{U,p) = lV{U\f). (52) 
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Proof: The proof is given in Appendix 10 ■ 
Corollary [1] proves that the explicit HK scheme, T-LK., achieves a rate region on XC{'H,p) which is 
within n* bits to a set of rate pair Tl^{'H,p) which contains its capacity region. Clearly, the counterpart 
of this explicit HK coding scheme for the reverse channel (with suitable changes in the channel matrices, 
INRs and the number of antennas) can achieve a rate region on ZC {W ,p^) which is within m* bits to 
7^" (T-L^jff), where m* is given by equation (|5T]) . However, from Lemma [T] we know that 

Thus the capacity regions of the two interference channels can not differ by more than max{m*,n*} 
bits proving the following theorem. 

Theorem 4: The capacity regions of IC{'H,p) and IC{H^ , ff) are within max{m|,n*} bits to each 
other, i.e., if (iii,i?2) G C{T-L,p), then there exists a rate pair {R\,R2) G C{W ^p^), the capacity region 
of the reverse channel, such that 

\{Ri - Rl)\ < max{m*,n*}, V 1 < i < 2. 

Proof: Let {Ri, R2) e C{n, p). From Corollary [U there exist a rate pair (^1, ^2) G Tl''{Ti,p) such 

that 

0<{Ri- Ri) <n*, V 1 < i < 2. (53) 

Further, from Lemma |7]we have {Ri, R2) € IZ^ {W, p') . Next, applying Corollary [T] for ICCH^ , /f), we 
have that there exists a rate pair {R[,R2) G C{W^p^) such that 

< {Ri - RI) < m*, V 1 < i < 2. (54) 

Note that equations ( |53] ) and (|54l ) provide ranges of Ri and i?[ and the magnitude of the difference 
between them is maximum when one takes its largest value and the other its smallest, i.e., 

\{Ri - Rl)\max = m* or n*, 

which proves the theorem. ■ 
Remark 15: Note that reciprocity holds for the 2-user MIMO IC without power scaling and this 
may seem counter-intuitive given the point-to-point MIMO channel result of ||33l . The reason is that 
reciprocity was shown here in the approximate capacity sense. The difference due to not scaling power 
gets absorbed in the gap that already exists between the exact capacity and the achievable region. 
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IV. Conclusion 



An approximate capacity region of the 2-user MIMO IC with an arbitrary number of antennas at each 
node is characterized. It is shown that a simple and an explicit HK coding schemes which can be seen to 
inherently perform a form of joint interference alignment in the signal space and in the signal level (see 
Section II.C of the companion paper ll26l for this interpretation) can achieve the capacity region within 
a constant gap. For a class of ICs, this gap is the tightest approximation to the capacity region of the 
MIMO IC found to date and this includes the SIMO ICs for which the gap is 1 bit independently of the 
number of antennas at the receivers. The explicit upper and lower bounds to the capacity region are used 
to prove the reciprocity of the MIMO IC in the constant-gap-to-capacity sense. 



Proof: The set of upper bounds to the achievable rate region will be derived in two steps. In 
the first step, the different mutual information terms in Fano's inequality are expanded in terms of the 
corresponding differential entropies. The genie-aided signaling strategies of ll24l are employed albeit with 
different side information (see Remark [1} to the receivers so that no negative entropy term involving inputs 
appear in the upper bound, thereby allowing for a single-letterization of the resulting bounds. The positive 
differential entropies are then upper bounded using Lemma |8] and its corollaries proved in Appendix |B] 
1) From Fano's lemma we have 



<I{X^;Y{^,X^) + nen, 
=I{X^; X^) + I{X^; YI'IX^) + nen, [/(X^"; X^) = 0, since Xu and X2t are independent] 
=/i( Vm(^ ® Hu)Xi + ZD - h{Z^) + nen 
<nlogdet [In, + piii?iii?Jj - logdet(/ArJ + nen, 



=nlogdet \^In, + puHuHl^j + ne„, 

where step (a) follows from the fact that the extra information (^2 ) at the receiver does not reduce 
mutual information and step (6) follows from Corollary 5] which in turn is proved in Appendix El 
Now dividing both sides by n and taking the limit as n — oo, we have 



Appendix A 



Proof of Lemma [T] 



nRi <I{X^;Y;') +ne., 

(a) 



-n 
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Fig. 8: 2-user MIMO IC with genie aided receivers. 



2) The second bound can be obtained similarly. 

3) The third upper bound is derived with the genie providing the side information (S'f,X2) to Rxi 
where Su is the interference plus noise at Rxj j ^ i (as shown in Fig. [8]l. Since additional 
information does not reduce mutual information we have from Fano's lemma 

n{Ri + R2) <I{X^; + ^(^2 ; ^2") + nen 

< /(Xf; Y^, S^,X^) + I{X^;Y^) + n6„, 

Again J(Xf;XJ) = 0, so that 

n(i?i + R2) </i(5i"|X2") - h{Z^) + hmS'^,X^) - hiZ'^) + h{Y^^) - h{Y^^\X^) + ne„, 
=/i(5i") - h{Z^) + h{Y,^\S^,X^) - h{Z^) + /i(y2") - + ?^en, 

=h{^iln (S) Hu)X^ + ^r|Si") + - n(iVi + iV2) log(27re) + ne„, 

<nlogdet (^In, + Pii-f/'ii (^^A/i + PuHI^H^^ + ne„+ 

nlogdet (In2 + PuHuHI^ + P22H22HI2) , 



where (6) follows from Corollaries |2] and |3] in Appendix |B] Now, dividing both sides by n and 
taking the limit as n — 00, we get the third bound since e„ — as n — 00. 
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4) The fourth bound can be obtained similarly assuming the genie provides the side information 



5) To derive the fifth upper bound we assume that the genie gives side information to Rxi. Using 
Fano's lemma we have 



n{Ri + R2) <I{X^; n") + HX^; + nen, 

<I{X^; 5?) + /(X2"; Y,^, S^) + nen, 

=I{X^; 5i") + I{X^-, Y,^\S^) + I{X^; S^) + I{X^; Y,"\S^) + nen, 
=/i(5i") + h{Y{'\S'^) - h{Y{''\S'^,X'^) + h{S^) + HYi'lS^) 

- h{Z^) - h{Z^) - h{Y^\S^, X^) + ne„, 

- h{Z^) - h{Z^) + nen, 

=h{Y{'\S'^) + hiY^^'lS^) - n{Ni + N2) log(27re) 

+ nen, ['■' Sl^is independent of 5", for i / j] 
<nlogdet (in^ + P2iH2iHI^ + puHu [hh + PuHI^Hu) ^ hI^\ + 



n log det I IN2 + P12H12HI2 + P22H22 ( IM2 + P2iHl^H2i ) + ne, 



n(2i?i + R2) <I{X^; n") + I{X'^;Y^, ) + I{X1^; Y^) + nen, 

<I{X^; Y^) + I{X'^;Y^, S^, X^) + 1{XI, Y^ , S^) + ne, 



where the last step again follows from the fact that extra information does not reduce mutual 



(5J,Xf) to RX2. 




where in step (d) we used Lemma [8] of Appendix |b1 twice. 



6) Again from Fano's lemma we have 
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infomiation. Next, using the chain rule of mutual infomiation we get 

n{2Ri + R2) <I{X^; YD + /(Xf; ^^1^2) + ^^"l^r, X^) + I{X^; S'i) + Y^\S^) + ne^, 

=/i(yi") - KY^X"^) + h(S'l\X^) - h[Sl\X^,X'^) + h[YDSl,XD- 

h{YDSl,X^,X^) + h{S^) - h{S'i\X^) + KY:^\S^) - h{Y^\S^,X^) + n6„, 
=h{YD - h{S^) + /i(5n - h{Z^) + h{{In ® Hu)X^ + ^r|Si") - h{Z^)+ 

h{S^) - h{Z^) + h{Y^\S^) - hiS'^lS^) + nen, 
=h{YD + h{^{In ® Hu)X^ + Zi"|5n + HY^^IS^) - n{2Ni + N2) log(27re) + ne„, 
<nlogdet (^In, + P2i^2ii^li + PuHuHI-^^ + 

n log det (^Mt + Pii-H'ii (yhh + PuHI^H^^ hI-^ + 

n log det ^In2 + PuH^hI^ + P22H22 {ihh + P2iH\^H2^ + ncn, 

where in step (e) we used Corollary |3] Corollary |2] and Lemma |8] of Appendix El Finally, dividing 
both sides by n and taking the limit with respect to n, we get equation (fT2l ). 
7) The 7*'* bound can be similarly derived as the last one. 



Appendix B 
Proof of Lemma [8] 

From equation (|2) we know that Su = y/pijHijXu + Zj represents the interference from user i plus 
the additive noise at receiver j at time t and 5" = [Sj-^ " " " ^in\ ^ '^^^ written as 






S? 



H^j 



K 









Xii 




Zil 




Xi2 




Zi2 






+ 






Xin 




Zin 



p-]iIn^H,,)X^ + Z^. 



Similarly, the output at Rxi over n channel uses can be written as 

= VP^iiln ® Hu)Xr + ^(/„ » Hji)X^ + Z^, for i / j G {1, 2}, 

where Xu G C^^'^^, \/ t < n satisfies the power constraints of equation (H). 
Lemma 8: For the n-length vector sequences Y-"' and Sf as described above 

hiYi"\S^) <n log det (in^ + Pj^HJ,Hl + puHu [Im. + Pi^H^H^j) ~' hJ^ + niV, log(27re) , 
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fori/jG{l,2}. 

Proof: We shall prove the Lemma for z = 1 with i = 2 case being identical. Denoting the covariance 
matrix of a zero-mean random vector V by Cov{V), i.e., Cov{V) = E(yyt) and the composite vector 
14 by 

\fPl2Hi2Xit + Z2t 
/PllHuXit + JJ)2lH2lX2t + Zit 



Vt 



Sit 




_ Yu _ 





V t < n, 



(55) 



it can be easily verified that 



E ( VtV^^ 



Pl2Hl2QltH\2 + In^ ^ H12\J yil^^ V2^1t^-^ll 

\fPll\fPv2HllQltH\2 PiiHiiQuHIt^ + P2lH2lQ2tH2i + Ini 



^y/PuHuQitHl 



y t<n. 
(56) 



Note that in the above computation we assumed that input distribution has zero mean, which is standard 
since a non-zero mean only contributes to power inefficiency. Let us define 

5*1 =^/pV2Hl2X'^ + Z21 

y; =^iHiiXf + ^iH2iX^ + Zi, 

where Xf ~ C7V(0, ^ XlILi Qit) for 1 < i / j < 2 and Xf and Xi^ are mutually independent. It 



can be easily verified that S^ and 



5r 



are Gaussian vectors with covariance matrices K and Kj, 



respectively, where K = puHuQiHl^ + J^y^, = 1 Ya=i Qa and 



Kj 



fWi2JpiiHi2QiH{^ 



P12H12Q1HI2 + IN2 
y/pEy/p^HuQiHl^ puHuQiHI^ + P21H21Q2HI1 + IN^ 



(57) 



which follows from the fact that any linear transformation of a Gaussian vector is also Gaussian (Propo- 
sition 5.2, 1351 ) and the sum of several mutually independent Gaussian vectors is also Gaussian. In other 
words, 



Cov 



SI 
Y* 



1 " 

-YCov 



t=\ 



Sit 
Yit 



Under the constraints satisfied on 5[, Y* and Vt, 1 < i < n, it was proved in Lemma 2 of ll20l that 

/i(yi"|5n <nh{Y{\Sl) 

S^ 



-n h 




h{Sl) 



-n 



(logdet(i?j)-logdet(^)) 
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Putting the values of Kj and K in the above equation and after some simpUfication, we get 

h{Y^\S'i) <nlogdet (In, + PuHuQiHI, + P21H21Q2HI- 



piipuHuQiHl^ [In, + PuHuQiHI^) ^ HuQiHI^^ + nNi log(27re), 

^=^nlogdet (^In, + P21H21Q2HI + puHnQf (Ia/, + puQf hI^HuQI) ' QfHl 

+ nNi log(27re), 
<nlogdet (^In, + P2iH2iHI^ + puHn (^hh + PuHI^H^^ ^ 
+ niVilog(27re), [.-Qi ^ Im^ 

where step (c) follows from the Woodbury identity and the last step follows from the fact that logdet(.) 
is a monotonically increasing function on the cone of p.d. matrices and Lemma |9] (below) with Gi = Ql, 
A = h\^Hi2 and G2 = hi,- ■ 
Lemma 9: Let ^ Gi ^ G2 and ^ ^4 are p.s.d. matrices of size n, then for any given vr G M+ 

Gi (/ + -kGiAGiY^ Gi ^ G2 (/ + vrGz^Ga)"^ G2. 

Proof: Let e G Gie = (Gi + el) and G2e = (G2 + el). For any such e, we have 

G2e >i Gi, ^ 0, or 
Gr/>=G2-V0, or 
(G^/ + >i (G2,^ + vrA) ^ 0, or 
(Gr/ + ^^)"'^(G2,2 + vrA)"\ or 
Gi, (/ + vrGi.AGi,)-^ Gi, ^ G2. (/ + TrGa.AGs.)-' G2.. 
From the definition of partial order between p.s.d. matrices we get 

X (Gi, (/ + vrGi.AGi,)-^ Gi,) < x (G2, (/ + ^G2eAG2e)"^ G2e) xt, V X G C^^"; 

lim X f Gi, (/ + t:Gi,AGu)'^ Gi)\ x^ < lim x f G2, (/ + 7rG2,^G2e)"^ G2,') x^ V x G C^'^", 

where the last step follows from the fact that for any Gi, A and vr as defined above and for any sequence 
of positive real numbers {en]^=i with e — )■ as — )• 00 we have 

Jim (g,,„ (/ + vrGi,, AG«J"' G,,„) = (Ci {I + ttG.AG,)'^ G^j . 
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Substituting their limits in the last equation we get 

X (Gi {I + vrGiAGi)-^ Gi) < x (G2 {I + ^G2^G'2)"^ G2) x\ y x e C^""". 



Invoking the definition of partial ordering once again, the lemma is proved. ■ 
The following corollaries can be proved from Lemma |8] In particular, Corollary |2]by setting Hji = 0, 

Corollary |3] by setting Hij = and Corollary |4] by putting Hji = and Hij = 0. 
Corollary 2: 

J ""it I ' 

+ niVaog(27re), {1,2}. 



h{^i{In ® Hu)X^ + I Sf) <n log det ( In. + puHa (Im, + PijHj H.j ) Hj 



Corollary 3: 

< n log det (^In. + Pj^HJiH]■ + PnH^^H}^ + niV, log(27re), i / j G {1, 2}. 

Corollary 4: 

K^i{In ® Hii)X2 + < nlogdet (/tv. + PuHuH]^ + niVaog(27re), i / j G {1,2}. 

Appendix C 
Proof of Lemma |4] 

As stated in the outline, it is only required to show that the right hand sides of the different bounds 



in the lemma are actually lower bounds to the corresponding terms in (I25al )- (l25il ). However, first we 
derive some common inequalities which will be used throughout the proof. From the definition of p.s.d. 
matrices ||36l we get 

(p,,i?,,i^»4) = (^^H,, {Im. + p^,HlH,,) 4) ^0, -.- (Jm. + PrMjH^,) >~ (58) 
On the other hand, for any given 7^ x G C^^^^ we have 

1 / ^/ -TT XX'^ 

where Hij G C^^^*^' is the singular value matrix of Hij, i.e., Hij = UijT,ijV^- and step (a) follows 

'1 



from the fact that Pij'^ij ylRU + Pij^lj^ijj ^Ij ^ -^A^j- However, the last inequality along with the 
definition of p.s.d. matrices imply 

PijH^jKijHl = (^^Hij {Im.+p,,h\^H,^~' hI)j ^^Jn,- (59) 
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Each of the eigenvalues of the matrix on the left hand side of equation (1591 ) is smaller than or equal to 1 . 
However, that matrix has only mm{Mi, Nj} = rriij non-zero eigenvalues since the rank of Hij is rriij, 
which in turn implies that 

logdet (^In, + PijHijKiuH]^ < rriij log = "^^i' ^^^^ 

for all 1 < i / j < 2. 

As a first step towards deriving the lower bounds, in what follows, we shall first derive lower bounds 
for the different mutual information terms of equation (I27i -(l33]). From equation (|27] ) we obtain 



log det ( P2iH2iK2uHI^ + In^ 



m2i, 



>logdet(gii/ni^ii^U^/..) 

=logdet [piiHiiKiHl^ + In,) - (mn log(Mi) + mai), (61) 

where step (a) follows from the fact that logdet(.) is a monotonically increasing function over the cone 
of positive-definite matiices with respect to the partial ordering and equations dSSl ) and (|60l ). The last 
equality follows from the fact that Hn is a rank nin matrix. Similarly, 

Yi\Wl Wf) >log det [p22H22K2Hl^ + In,) - (mas log(M2) + mis). (62) 

From equation (|29l ) we obtain 

I{WI; Yf\Xl) =log det (^^^H2iHI + In,^ - log det [p2iH2iK2uHl^ + In) , 

>logdet(^gli/2ii?li + /^,) -msi, [■.■ m] 

=logdet {p2iH2iHI^ + In) - (msi log(M2) + mai), (63) 



where the last step follows from the fact that the rank of H21 is 11121. Similarly, we have 

I{Wf- y/|Xf ) >log det (In, + pi2Hi2H\^ - (mi2 log(Mi) + mia). (64) 
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From equation (l30l l 

I{Xf;Yf\W^) =logdet (g-i^ni^li + P21H21K2UHI, + /^v,) - logdet (p2i^2ii^2«^li + /jv,) , 
Mogdet ("glifiiFji + lN?j - rfi2i > logdet (^^HiiH\^ + ^^A^:) " "^21, 



=logdet i^piiHuHl^ + /jvi j - (mn log(Mi) + mai), (65) 

where steps (6) follows from equations (|58] ) and (|60] l and the fact that logdet(.) is a monotonically 
increasing function over the cone of positive-definite matrices. The last equality follows from the fact 
that Hii is a rank inn matrix. Similarly, we have 

I{Xl;Yi\Wf) > log det (^22^^22^12 + In,) - (m22 log(M2) + mis). (66) 
Next, we focus on the term Wf; Y^). 

I{Xl Wf- Yi) =logdet (^^H^2H\^ + g|^22i^2^2 + In^I - logdet (^pi2Hi2K,^Hl^ + In, 
=logdet (puHuHI^ + P22H22HI2 + In,) - (min{iV2, MJ log(M^.) + mia), 



where the last step follows from the fact that the matrix ^|^i?i2-ff|2 + ir"-^22-f^22) tank mm{N2, Mg}, 
Mg and are as defined in Section JII] Similarly, we have 



I{Xf, Wi; Yf) >log det (p2ii^2i^li + PuHuHI + In,)- (min{iVi, MJ log(M,) + m2i). 
From equation (OTT l 

IiXf,Wi;Yf\Wf) =logdet (^puHuKiuHl + ^H2iHI + 

- logdet ip2iIl2iK2uHl^ + /aTi 



>logdet (^^FnA'ii^li + J^^H2iH^i + In, ] - mai, [•.• S 
>logdet (fl^ni^ii^Ii + f|//2ii/2\ + ) - n.21, 

(a) 



logdet (^puHuKiHI^ + p2ii^2i^^2i + -^JVi' 

- (min{iVi, AfJ log(A4) + m2i) = /f, , 
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where step (a) follows from the fact that the sum of the first two matrices inside logdet(.) has a rank 
of mm{Ni, Ms}, and similarly, 

I{Xl, Wf; Yi\Wl) >logdet (/,v. + 9x2^x2^X2 + P22^22i^2i^l2^ 

- (min{iV2, MJ log(M^.) + 7x1x2). (67) 

Note that each of the lower bounds are difference of a channel dependent term and a constant. To denote 
the constants by a common notation we define n* = min{A^j, M^} log(M2;) + rhji, for z / j G {l7 2}. 
Using this notations and equations ([6T])-(|67]) we get the following set of stricter bounds]^ 



Rx < 
Ri < 

R2 < 
R2 < 



Ri+R2< 



Ri + R2< 



Ri+R2< 



+ 



2Ri +R2< 



+ 



Ri + 2R2 < 



log det 
log det 

log det 
log det 

log det 

log det 

log det 
log det 
log det 
log det 
log det 
+ log det 



In, + piiHuHlj -nl] ; (68a) 

Ia/i + puHI^Hu + 1) - (mil log(Mi) + mi2 log(Mi + 1)) - m2i 

(68b) 

Itv, + P22H22HI2) -nlY; (68c) 

hi, + P22HI2H22 + K^^) - {m22 log(M2) + m2i log(M2 + 1)) - mi2^ 

(68d) 

+ P12H12H12 + P22H22H22) 
+ log det (In, + piiHiiKiHl^ - {n\ + nl) 

In, + P2iH2iHI^ + piiHuHl^'^ 
+ log det (In, + P22H22K2HI^ -{n\ + n*2)] ; (68f) 



(68g) 



(68e) 



In, + P2iH2iHI^ + piiHuKiHl^ 



In, + P12H12HI2 + p22ll22K2Hl^j - {n\ + nl) 
In, + P2iH2iHI^ + piiHiiH\^ + log det (In, + piiHiiKiH\^ 
In, + P12H12HI2 + P22H22K2HI^ - {2n\ + n^))^; 
In, + Pi2-H'i2-H'i2 + P22II22II22) + log det (^Im, + P22II22K2H22) 
In, + ^21^21^21 + Pii^^iii^i^ii) - {nl + 2nl 



(68h) 



(681) 



It is both necessary and sufficient to keep the non-negative parts of the lower bounds because Ri's can not be negative and 
the rate pair (0, 0) is trivially achievable, respectively. 
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Since the right hand sides of each of the bounds in equation (l68l ) is smaller than those in the corresponding 
bound in equation (1251 ). the set of rate tuples defined by the above set of bounds is a subset of TZ^^{Ps). 
However, in equation (|68]| there are two bounds on each Ri. To combine them into one we define 

rii = max{(mjj log(Mj) + ruij log(Mj + 1)) + mji,n*}, V i 7^ j G {1, 2}. 

Using this notation the first four bounds on Ri and R2 in equation (l68l) can be replaced by following 
two further stricter bounds. 

Ri < (^logdet (^In, + piii/iii/Ji) -ni)^; 

i?2 < ( log det (^/at, + P22H22HI-^ -n2^ , 

where we have used the fact that Kf^ is a p.d. matrix and logdet(.) is a monotonically increasing 
function in the cone of p.s.d. matrices. Substituting the above two equations in place of the first four in 
( l68l) we get the bounds of the Lemma since Ui > n*, for all i G {1, 2}. 

Appendix D 
Proof of Lemma [5] 

As stated in the outline, first we show that when (i?i,i?,2) G 7^2> equations ( |25b| ) and ( |25d|) both can 
not be violated simultaneously. Suppose, it is not true, i.e., (i?i,i?2) G ^2 but 

R, >IiXf;Yf\Wf,Wi) + I{Wf;Yi\Xl)- 

R2 >/(Xf ; Yi\Wf, Wi) + I{Wi- Yf\Xl). 

Adding the above two equations we get 

Ri + R2 >I{Xf; Yf\Wf, Wi) + I{Wf; Yi\Xl) + ; Yi\Wf, Wi) + Yf\Xf), 
>I{Xf; Yf\Wf, Wi) + I{Wl; Y^\Wi) + I(Xf ; Y^\Wl W^) + I{Wi; Yf\Wf), 
=IiXf, Wi; y/K) + liXl, Wf; Yi\Wi), 

which violates equation ( |49e| ) and contradicts the assumption that (i2i,i?2) S ^2- Therefore, whenever 
{Ri,R2) G 7^2 but i?2) ^ '^hk(^s) oi^ly among ( |25b| ) and ( |25d| ) is violated and not both. 

Next we shall show that such a rate pair is always achievable by the explicit HK coding scheme. 
Suppose (i?i,i?2) £ but (-Ri,i?2) ^ ^hk(^s) because it violates equation (I25bb . i.e., 

i?i >I(Xf ; y/l , + I(VFf ; (69) 
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From Definition |4] we know that in this case, Txi transmits only its private message, i.e., the coding 
scheme 7ilC{{-^lMi,0, K2u, K2w}) is used. To distinguish the codewords of the first user in this case 
to those used in the simple HK scheme we use the following notations: 

Xf = uf ^ CATiO, ^/mJ, Wf = Wi = Wl and i7| = [/|, (70) 

Ml 

are mutually independent and = ^2 + ^1' where y.^'s are the outputs of the channel (e.g., see 
equations (|2]l) when 's are the inputs. Evidently, if the joint distribution of these random vectors be 
denoted by P,^ (Xf , f/f , TVf , , then Ps,{.) e V* . Putting P* = P.,, in the expression for 

7^^(P*) we get the achievable region of the coding scheme 'H}C{{jj^Imi,0, K2u, K2w}) as 

^hk(^.J = {(«i,^2) : ^1 <IiXf;Yf\W^); 

R2 <IiXl-Yi)- 

R2 mWl-,Yf\Xl)+I{^-Yi\Wl)-, 
Ri + R2 <IiXf, Wi- Yf) + im-Yi\Wi)} 
which can easily be shown to be equivalent to 

^hk(^^J = {(^i>^2) : Ri <I{XhYf\Wi)- (71) 

R2<I{Xl-Yi)- (72) 

R2 <I{Wl;Yf\Xl) + I{X3^-Yi\Wi)- (73) 

Ri + R2 <IiXf, Wi; Yf) + y/|VF|)} (74) 

using the relations between Vf's and Vf's defined earlier, where V G {U,W, X,Y}. In what follows, 
we shall shown that i?2) G ^HK(-fsi) and hence achievable by T-LIC. 
From (I49al) we know that 

Ri <I{Xf;Yf\Wi). (75) 

R2 <I{Xl;Yi). (76) 

R2 </(VF|; Yf\Wf) + I(X|; r/|Ty|); (77) 
</(l^|; Yf\Xf) + /(Xf; (78) 



From (|69l ) and (|49d| ) we obtain 



From (|69l ) and (|49e| ) we obtain 
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and from ^ and (HIB 

Ri + R2 <I{Xf, Wl- Yf) + /(Xf ; Yi\Wl). (79) 

This proves that {Ri,R2) € TZ^^{Ps^) and hence achievable by UK,. 
The other case when {Ri, R2) G 7^2 but equation (I25db is violated, i.e., 

i?2 >/(^f ; il'lH^f , + I{Wl; Yf\Xl), (80) 

can be similarly proved. Again from Definition 51 we know that in this case the coding scheme T-LK, 

is used. Defining 

If = Ul ~ CAA(0, -L/j^.J, #1 = VFf = Wl (If = Ul (81) 
M2 

and = wf + C/f, where y.^'s are the outputs of the channel (e.g., see equations (|2]i) when Xf's, are 
the inputs, it is clear that the joint distributions of these variables, Pg^ (Xf , (jf, Wf,Xl, Ul W^) £ V* . 
Finally, putting P* = Pg^ in the expression for Tl^^{P*) the achievable region of the coding scheme 
^{^.{{Kiu, Kiw, Ti^^Ma, 0}) can be computed. Now, combining equation ([8OI ) and (l49l ) in the same way 
as the previous case, it can be shown that R2) € Tl^^{Ps2) and is hence achievable by HIC. 

Finally, if {Ri,R2) € TZ^^{Ps), then by Lemma[3]we know that TilCdKiu, Ki^, K2u, K2w}) or the 
simple HK scheme can achieve this rate pair. 

Appendix E 
Proof of Lemma |7] 

We shall prove this lemma in two steps. In step one, we shall prove 
where T-L = {-f^n, i^2i' ^i2' -^22} second step we shall prove that 

Clearly, the above two equalities prove the lemma. 

Stepl: Let us consider the interference channel TC p^^ . Following a similar method as in Lemma [T] 
we can derive an upper bound to the capacity region of this IC. Let the corresponding bounds of 
TZ^ (i-L, p''^ be denoted by 1 < /c < 7. In what follows, we shall first prove that /ft3 = II, 1^ = 
and Ibk = 11 for k G {1,2,5,6,7}. 
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Towards proving the first equality, from equation © we get 

h3 = log det (ijv. + PuHuHI^ + P22 ^22^^12) + log det (^In, + PiiHu (^hi, + Pi2H\^Hi'^ ^ 
= log det (^/at, + P12H12HI2 + P22H22H12^ + log det (^Im^ + PuHI^H^ + pnHl^Hu 



-log det (^Im, + PuHl^Huj , 

logdet(^JAr2 + P12H12HI2 + P22H22HI^ + log det {Imi + Pl2-H'l2-f^l2 + PiiH\^Hii 

-log det {In^ + puHuHI^^ , [•.- logdet(/ + ^B) = logdet(/ + BA)] 
log det (im, + P22HI2 (In, + P12H12HI2) ^ H22] + log det (Im, + PuHI^Hu + puHl-^Hu 



J- A ■ 



Similarly, it can be proved that 1^4 = /g. The equality of the first two bounds follow trivially from the 
identity logdet(/ + AB) = logdet(/ + BA). Now, towards proving the fifth bound we see 



45(1) = logdet(/jVi + P2iH2iHI^ + piiHuKiHl^^ 

= log det f /tVi + Pu [In, + P2iH2iHI^ ^ HnKiHl^ ] + log det (^In, + 

P21-f^21-f^2l) 



log det (^Im, + piii^li (/at, + P2iH2iHI^ HuK^ + log det (^hi^ + ^21^^21 ^21 
logdetfiTf 1 + piiH\^ (In, + P21H21HIX' Hu) + logdet(A'i) + log det {K^') 



= log det (^Im, + P12HI2H12 + puHI^ (^In, + P2iH2iHI^ ^ Hu 
+ log det (iTi) - log det (7^2)- 
Similarly, it can be easily proved that 

/fe5(2) = log det (^/jv, + P12H12HI2 + P22H22K2HI2 

= log det (^M2 + P2iHI^H2i + P22HI2 {In2 + ^12^^12^12) ^22 
+ log det(K2) - log det {Ki) . 
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Combining the last two equations we get 



45 =45(1) + 45(2), 






Proving the equality of the other two bounds is similar. Hence, the set upper bounds for the capacity 
region of IC ii-L,p''\ defines the same set of rate pairs as IZ^ {H^p). 



Stepl: Suppose 5 is a p.s.d. matrix and 5* represents its complex conjugate, i.e., the matrix obtained by 
replacing all its entries by the corresponding complex conjugates. Then, using the fact that its eigen-values 
are real, it can be easily be proved that 



However, note that all the terms in the different bounds of Lemma [T] are of the form just described. 
This in turn proves that if we replace all the channel matrices of a 2-user MIMO IC by their complex 
conjugates the set of upper bounds remain the same. From this fact, it easily follows that 
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